gary109
's Collections
Representations
updated
Natural Language Supervision for General-Purpose Audio Representations
Paper
•
2309.05767
•
Published
•
9
AudioSR: Versatile Audio Super-resolution at Scale
Paper
•
2309.07314
•
Published
•
25
FoleyGen: Visually-Guided Audio Generation
Paper
•
2309.10537
•
Published
•
8
Toward Joint Language Modeling for Speech Units and Text
Paper
•
2310.08715
•
Published
•
7
UniAudio: An Audio Foundation Model Toward Universal Audio Generation
Paper
•
2310.00704
•
Published
•
21
Reconstructive Latent-Space Neural Radiance Fields for Efficient 3D
Scene Representations
Paper
•
2310.17880
•
Published
•
7
Harvesting Textual and Structured Data from the HAL Publication
Repository
Paper
•
2407.20595
•
Published
•
21
Open-Vocabulary Audio-Visual Semantic Segmentation
Paper
•
2407.21721
•
Published
•
8
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation
Learning for Neural Radiance Fields
Paper
•
2404.01300
•
Published
•
4
UniT: Unified Tactile Representation for Robot Learning
Paper
•
2408.06481
•
Published
•
9
SlotLifter: Slot-guided Feature Lifting for Learning Object-centric
Radiance Fields
Paper
•
2408.06697
•
Published
•
14