1 33 88

momo

wzc991222

AI & ML interests

None yet

Recent Activity

liked a model about 24 hours ago

deepseek-ai/DeepSeek-R1

liked a model 7 days ago

hexgrad/Kokoro-82M

liked a model 12 days ago

microsoft/phi-4

View all activity

Organizations

wzc991222's activity

liked a model about 24 hours ago

deepseek-ai/DeepSeek-R1

Updated about 2 hours ago • 859

liked a model 7 days ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated 3 days ago • 27.4k • 2.12k

liked a model 12 days ago

microsoft/phi-4

Text Generation • Updated 12 days ago • 134k • 1.48k

commented a paper 18 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published 19 days ago • 97 •

upvoted a paper 22 days ago

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published 25 days ago • 25

upvoted a collection 22 days ago

Deepseek Papers

Collection

Deepseek papers collection • 14 items • Updated 22 days ago • 10

upvoted a paper 25 days ago

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published 29 days ago • 64

liked a model 25 days ago

deepseek-ai/DeepSeek-V3

Updated 22 days ago • 162k • 2.1k

liked a model 26 days ago

deepseek-ai/DeepSeek-V3-Base

Updated 22 days ago • 17.4k • 1.29k

liked a model about 1 month ago

deepseek-ai/deepseek-vl2

Image-Text-to-Text • Updated Dec 18, 2024 • 2.35k • 135

liked a Space about 1 month ago

Running

477

📈

Scaling test-time compute

liked a model about 1 month ago

rhysjones/gpt2-124M-edu-fineweb-10B

Text Generation • Updated Jun 19, 2024 • 453 • 6

upvoted a paper about 1 month ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 104

liked a model about 1 month ago

recursal/QRWKV6-32B-Instruct-Preview-v0.1

Text Generation • Updated 29 days ago • 667 • 65

liked a Space about 1 month ago

Running

🔥

OPEN-MOE-LLM-LEADERBOARD

upvoted a paper about 1 month ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 74

liked a Space about 1 month ago

Running on Zero

3.17k

🏢

TRELLIS

Scalable and Versatile 3D Generation from images

upvoted a paper about 1 month ago

Structured 3D Latents for Scalable and Versatile 3D Generation

Paper • 2412.01506 • Published Dec 2, 2024 • 55

liked a model about 2 months ago

HuggingFaceTB/SmolLM2-135M

Text Generation • Updated Nov 23, 2024 • 153k • 45

upvoted a paper about 2 months ago

o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published Nov 29, 2024 • 43