STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution Paper • 2501.02976 • Published 16 days ago • 52
StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors Paper • 2412.11586 • Published Dec 16, 2024 • 11
StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors Paper • 2412.11586 • Published Dec 16, 2024 • 11
InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption Paper • 2412.09283 • Published Dec 12, 2024 • 19
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement Paper • 2411.06558 • Published Nov 10, 2024 • 34
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation Paper • 2407.02371 • Published Jul 2, 2024 • 51
NTIRE 2020 Challenge on Real-World Image Super-Resolution: Methods and Results Paper • 2005.01996 • Published May 5, 2020
Learning Versatile 3D Shape Generation with Improved AR Models Paper • 2303.14700 • Published Mar 26, 2023
PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization Paper • 2312.06354 • Published Dec 11, 2023 • 1
DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation Paper • 2403.17664 • Published Mar 26, 2024
Anywhere: A Multi-Agent Framework for Reliable and Diverse Foreground-Conditioned Image Inpainting Paper • 2404.18598 • Published Apr 29, 2024
RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network Paper • 2406.18284 • Published Jun 26, 2024 • 19
Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability Paper • 2402.12225 • Published Feb 19, 2024 • 8
Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability Paper • 2402.12225 • Published Feb 19, 2024 • 8
AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning Paper • 2402.00769 • Published Feb 1, 2024 • 22
Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling Paper • 2401.15977 • Published Jan 29, 2024 • 37
VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation Paper • 2312.09251 • Published Dec 14, 2023 • 7
ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks Paper • 2311.09835 • Published Nov 16, 2023 • 10