-
aMUSEd: An Open MUSE Reproduction
Paper • 2401.01808 • Published • 28 -
black-forest-labs/FLUX.1-dev
Text-to-Image • Updated • 1.17M • • 7.75k -
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text • Updated • 1.6M • 1.01k -
zer0int/CLIP-GmP-ViT-L-14
Zero-Shot Image Classification • Updated • 4.98k • 362
Collections
Discover the best community collections!
Collections including paper arxiv:2401.01808
-
Rich feature hierarchies for accurate object detection and semantic segmentation
Paper • 1311.2524 • Published • 1 -
DeepPose: Human Pose Estimation via Deep Neural Networks
Paper • 1312.4659 • Published • 1 -
Generative Adversarial Networks
Paper • 1406.2661 • Published • 2 -
scikit-image: Image processing in Python
Paper • 1407.6245 • Published • 1
-
Boundary Attention: Learning to Find Faint Boundaries at Any Resolution
Paper • 2401.00935 • Published • 17 -
Taming Mode Collapse in Score Distillation for Text-to-3D Generation
Paper • 2401.00909 • Published • 9 -
Q-Refine: A Perceptual Quality Refiner for AI-Generated Image
Paper • 2401.01117 • Published • 8 -
En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data
Paper • 2401.01173 • Published • 11
-
aMUSEd: An Open MUSE Reproduction
Paper • 2401.01808 • Published • 28 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper • 2401.01885 • Published • 27 -
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Paper • 2401.00604 • Published • 4 -
LARP: Language-Agent Role Play for Open-World Games
Paper • 2312.17653 • Published • 31
-
aMUSEd: An Open MUSE Reproduction
Paper • 2401.01808 • Published • 28 -
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models
Paper • 2401.05252 • Published • 47 -
Scalable Pre-training of Large Autoregressive Image Models
Paper • 2401.08541 • Published • 36 -
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Paper • 2401.09417 • Published • 59
-
GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis
Paper • 2312.02155 • Published • 12 -
LivePhoto: Real Image Animation with Text-guided Motion Control
Paper • 2312.02928 • Published • 16 -
FaceStudio: Put Your Face Everywhere in Seconds
Paper • 2312.02663 • Published • 30 -
aMUSEd: An Open MUSE Reproduction
Paper • 2401.01808 • Published • 28