Kyle's picture

Kyle PRO

iky1e

·

https://ikyle.me

kylehowells

AI & ML interests

None yet

Recent Activity

liked a model about 13 hours ago

VITA-MLLM/VITA-1.5

liked a model about 13 hours ago

VITA-MLLM/Long-VITA-1M

liked a model about 13 hours ago

VITA-MLLM/Long-VITA-128K

View all activity

Organizations

None yet

iky1e's activity

upvoted an article 3 days ago

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

By

•

4 days ago

• 18

upvoted a paper 14 days ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 112

upvoted a paper 15 days ago

UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks

Paper • 2407.02158 • Published Jul 2, 2024 • 1

upvoted a collection 18 days ago

SmolVLM

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 16 days ago • 30

upvoted a paper 18 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 19 days ago • 338

upvoted a paper 19 days ago

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Paper • 2411.07975 • Published Nov 12, 2024 • 27

upvoted a collection 19 days ago

OLMo 2

Artifacts for the second set of OLMo models. • 22 items • Updated about 22 hours ago • 72

upvoted a paper 20 days ago

Wonderland: Navigating 3D Scenes from a Single Image

Paper • 2412.12091 • Published 22 days ago • 15

upvoted a collection 21 days ago

VisionLM

596 items • Updated 1 day ago • 39

upvoted 3 papers 23 days ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 25 days ago • 136

Large Action Models: From Inception to Implementation

Paper • 2412.10047 • Published 25 days ago • 32

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published 26 days ago • 87

upvoted a collection 23 days ago

Embedding Model Datasets

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 67 items • Updated Jul 3, 2024 • 91

upvoted a collection 24 days ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated about 22 hours ago • 292

upvoted a paper 24 days ago

GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation

Paper • 2411.08033 • Published Nov 12, 2024 • 22

upvoted 5 collections 24 days ago

VoxPopuli v2

A collection of checkpoints from the second VoxPopuli release. • 35 items • Updated Jan 16, 2024 • 5

VoxPopuli

A collection of open-source artefacts (datasets + checkpoints) from the first VoxPopuli release. • 32 items • Updated Jan 16, 2024 • 4

Robust Wav2Vec 2.0

A collection of "robust" Wav2Vec 2.0 checkpoints pre-trained on datasets from multiple domains. • 4 items • Updated Jan 16, 2024 • 3

XLSR

A collection of multilingual Wav2Vec 2.0 checkpoints pre-trained on 53 languages and fine-tuned for CTC speech recognition. • 12 items • Updated Jan 16, 2024 • 6

Wav2Vec 2.0

A collection for the first release of Wav2Vec 2.0, a speech encoder that learns powerful representations from unlabelled audio data. • 8 items • Updated Jan 16, 2024 • 18