Cosmo's picture

Cosmo

cosmojg

·

https://cosmo.red

AI & ML interests

Machine learning and computational neuroscience

Recent Activity

liked a model 1 day ago

FuseAI/FuseO1-DeepSeekR1-QwQ-SkyT1-32B-Preview

upvoted a collection 1 day ago

liked a Space 1 day ago

fishaudio/fish-speech-1

View all activity

Organizations

None yet

cosmojg's activity

upvoted a collection 1 day ago

FuseO1-Preview

System-II Reasoning Fusion of LLMs • 6 items • Updated about 7 hours ago • 5

upvoted 2 articles 1 day ago

Article

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

Feb 27, 2024

• 47

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

By

•

1 day ago

• 31

upvoted a paper 5 days ago

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

Paper • 2501.09751 • Published 6 days ago • 44

upvoted an article 7 days ago

Article

Diving into MiniMax01 405B MoE

By

•

7 days ago

• 17

upvoted a paper 7 days ago

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers

Paper • 2410.10629 • Published Oct 14, 2024 • 10

upvoted 2 collections 13 days ago

Deepseek V3 (All Versions)

Deepseek V3 - available in bf16, original, and GGUF formats, with support for 2, 3, 4, 5, 6 and 8-bit quantized versions. • 3 items • Updated 1 day ago • 28

Cosmos

The collection of Cosmos models • 31 items • Updated 5 days ago • 241

upvoted a collection 16 days ago

Dolphin 3.0

Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 7 items • Updated 17 days ago • 57

upvoted a collection 28 days ago

Granite 3.1 Language Models

A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated Dec 18, 2024 • 48

upvoted a collection about 1 month ago

EXAONE-3.5

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B. • 10 items • Updated Dec 10, 2024 • 88

upvoted 2 collections about 2 months ago

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 19 items • Updated 14 days ago • 87

OLMo 2

Artifacts for the second set of OLMo models. • 22 items • Updated 16 days ago • 74

upvoted a collection 2 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 192

upvoted 4 collections 3 months ago

LongVU

7 items • Updated Oct 31, 2024 • 28

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 103

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated about 1 month ago • 204

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated Dec 18, 2024 • 96

upvoted an article 3 months ago

Article

Allegro: Advanced Video Generation Model

By

•

Oct 22, 2024

• 58

upvoted a collection 3 months ago

VPTQ Qwen 2.5 72B Instruct without finetune

arxiv.org/abs/2409.17066, VPTQ Qwen 2.5 72B Instruct without finetune • 8 items • Updated Oct 18, 2024 • 1