DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation Paper • 2411.16657 • Published Nov 25, 2024 • 17
DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation Paper • 2411.16657 • Published Nov 25, 2024 • 17
VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement Paper • 2411.15115 • Published Nov 22, 2024 • 9
VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement Paper • 2411.15115 • Published Nov 22, 2024 • 9
Glider: Global and Local Instruction-Driven Expert Router Paper • 2410.07172 • Published Oct 9, 2024
Adapt-$\infty$: Scalable Lifelong Multimodal Instruction Tuning via Dynamic Data Selection Paper • 2410.10636 • Published Oct 14, 2024
SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation Paper • 2410.12761 • Published Oct 16, 2024
Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative Models Paper • 2304.01515 • Published Apr 4, 2023
Analyzing and Mitigating Object Hallucination in Large Vision-Language Models Paper • 2310.00754 • Published Oct 1, 2023
On the Soft-Subnetwork for Few-shot Class Incremental Learning Paper • 2209.07529 • Published Sep 15, 2022 • 1
Forget-free Continual Learning with Soft-Winning SubNetworks Paper • 2303.14962 • Published Mar 27, 2023 • 1
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences Paper • 2401.10529 • Published Jan 19, 2024 • 1
EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents Paper • 2403.12014 • Published Mar 18, 2024
ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models Paper • 2310.02998 • Published Oct 4, 2023 • 1
Progressive Fourier Neural Representation for Sequential Video Compilation Paper • 2306.11305 • Published Jun 20, 2023
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data Paper • 2403.06952 • Published Mar 11, 2024
RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives Paper • 2405.18406 • Published May 28, 2024 • 1
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens Paper • 2211.10636 • Published Nov 19, 2022