Image / Video Gen Collection Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 32 items • Updated 1 day ago • 6
Image / Video Gen Collection Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 32 items • Updated 1 day ago • 6
Large Motion Video Autoencoding with Cross-modal Video VAE Paper • 2412.17805 • Published 13 days ago • 23
Multimodal Language Model Collection What does matter besides data receipt when training a Multimodal language model? • 28 items • Updated 15 days ago • 1
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper • 2412.05271 • Published about 1 month ago • 123
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 23 days ago • 136
Multimodal Language Model Collection What does matter besides data receipt when training a Multimodal language model? • 28 items • Updated 15 days ago • 1
Image / Video Gen Collection Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 32 items • Updated 1 day ago • 6
STIV: Scalable Text and Image Conditioned Video Generation Paper • 2412.07730 • Published 26 days ago • 70
Image / Video Gen Collection Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 32 items • Updated 1 day ago • 6
Open Datasets Collection Thank you for sharing your dataset. I’ve fed them to my model, and they are benefit to it. • 15 items • Updated 29 days ago
Image / Video Gen Collection Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 32 items • Updated 1 day ago • 6
Image / Video Gen Collection Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 32 items • Updated 1 day ago • 6
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published Dec 4, 2024 • 121