Sherrie Walton's picture

29 3

Sherrie Walton

sherriew

AI & ML interests

Voice Assistants (e.g. Siri and Alexa)

Organizations

None yet

sherriew's activity

upvoted 20 papers 5 months ago

AMEX: Android Multi-annotation Expo Dataset for Mobile GUI Agents

Paper • 2407.17490 • Published Jul 3, 2024 • 31

BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation

Paper • 2407.17952 • Published Jul 25, 2024 • 30

SHIC: Shape-Image Correspondences with no Keypoint Supervision

Paper • 2407.18907 • Published Jul 26, 2024 • 41

Wolf: Captioning Everything with a World Summarization Framework

Paper • 2407.18908 • Published Jul 26, 2024 • 32

Floating No More: Object-Ground Reconstruction from a Single Image

Paper • 2407.18914 • Published Jul 26, 2024 • 20

VSSD: Vision Mamba with Non-Casual State Space Duality

Paper • 2407.18559 • Published Jul 26, 2024 • 19

Lessons from Learning to Spin "Pens"

Paper • 2407.18902 • Published Jul 26, 2024 • 21

Knesset-DictaBERT: A Hebrew Language Model for Parliamentary Proceedings

Paper • 2407.20581 • Published Jul 30, 2024 • 23

Meltemi: The first open Large Language Model for Greek

Paper • 2407.20743 • Published Jul 30, 2024 • 67

Futga: Towards Fine-grained Music Understanding through Temporally-enhanced Generative Augmentation

Paper • 2407.20445 • Published Jul 29, 2024 • 20

A Large Encoder-Decoder Family of Foundation Models For Chemical Language

Paper • 2407.20267 • Published Jul 24, 2024 • 31

Integrating Large Language Models into a Tri-Modal Architecture for Automated Depression Classification

Paper • 2407.19340 • Published Jul 27, 2024 • 58

MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains

Paper • 2407.18961 • Published Jul 18, 2024 • 40

WalkTheDog: Cross-Morphology Motion Alignment via Phase Manifolds

Paper • 2407.18946 • Published Jul 11, 2024 • 12

TAPTRv2: Attention-based Position Update Improves Tracking Any Point

Paper • 2407.16291 • Published Jul 23, 2024 • 11

CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting

Paper • 2401.18075 • Published Jan 31, 2024 • 8

Agile But Safe: Learning Collision-Free High-Speed Legged Locomotion

Paper • 2401.17583 • Published Jan 31, 2024 • 25

RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Paper • 2401.18059 • Published Jan 31, 2024 • 36

TextCraftor: Your Text Encoder Can be Image Quality Controller

Paper • 2403.18978 • Published Mar 27, 2024 • 13

Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation

Paper • 2403.19319 • Published Mar 28, 2024 • 12