VidTok: A Versatile and Open-Source Video Tokenizer Paper • 2412.13061 • Published 15 days ago • 8 • 2
Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement Paper • 2406.08096 • Published Jun 12, 2024
IGOR: Image-GOal Representations are the Atomic Control Units for Foundation Models in Embodied AI Paper • 2411.00785 • Published Oct 17, 2024 • 8
Memories are One-to-Many Mapping Alleviators in Talking Face Generation Paper • 2212.05005 • Published Dec 9, 2022
End-to-End Rate-Distortion Optimized 3D Gaussian Representation Paper • 2406.01597 • Published Apr 9, 2024
DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder Paper • 2303.17550 • Published Mar 30, 2023
IGOR: Image-GOal Representations are the Atomic Control Units for Foundation Models in Embodied AI Paper • 2411.00785 • Published Oct 17, 2024 • 8
Compositional 3D-aware Video Generation with LLM Director Paper • 2409.00558 • Published Aug 31, 2024 • 14
Compositional 3D-aware Video Generation with LLM Director Paper • 2409.00558 • Published Aug 31, 2024 • 14