view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 • 4 days ago • 18
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published Nov 15, 2024 • 112
UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks Paper • 2407.02158 • Published Jul 2, 2024 • 1
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 16 days ago • 30
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation Paper • 2411.07975 • Published Nov 12, 2024 • 27
OLMo 2 Collection Artifacts for the second set of OLMo models. • 22 items • Updated about 22 hours ago • 72
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 25 days ago • 136
Large Action Models: From Inception to Implementation Paper • 2412.10047 • Published 25 days ago • 32
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 67 items • Updated Jul 3, 2024 • 91
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated about 22 hours ago • 292
GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation Paper • 2411.08033 • Published Nov 12, 2024 • 22
VoxPopuli v2 Collection A collection of checkpoints from the second VoxPopuli release. • 35 items • Updated Jan 16, 2024 • 5
VoxPopuli Collection A collection of open-source artefacts (datasets + checkpoints) from the first VoxPopuli release. • 32 items • Updated Jan 16, 2024 • 4
Robust Wav2Vec 2.0 Collection A collection of "robust" Wav2Vec 2.0 checkpoints pre-trained on datasets from multiple domains. • 4 items • Updated Jan 16, 2024 • 3
XLSR Collection A collection of multilingual Wav2Vec 2.0 checkpoints pre-trained on 53 languages and fine-tuned for CTC speech recognition. • 12 items • Updated Jan 16, 2024 • 6
Wav2Vec 2.0 Collection A collection for the first release of Wav2Vec 2.0, a speech encoder that learns powerful representations from unlabelled audio data. • 8 items • Updated Jan 16, 2024 • 18