Chong Ruan

Chester111

AI & ML interests

AGI & LLM

Recent Activity

authored a paper 17 days ago

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

updated a collection 17 days ago

DeepSeek-VL2

new activity 19 days ago

deepseek-ai/DeepSeek-Prover-V1.5-RL:236B?

View all activity

Organizations

Chester111's activity

authored a paper 17 days ago

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Paper • 2412.10302 • Published 21 days ago • 11

updated a collection 17 days ago

DeepSeek-VL2

Collection

4 items • Updated 17 days ago • 34

New activity in deepseek-ai/DeepSeek-Prover-V1.5-RL 19 days ago

236B?

#9 opened 21 days ago by

erichartford

New activity in deepseek-ai/DeepSeek-V2.5-1210 24 days ago

Adds `transformers` as a library

#1 opened 24 days ago by

reach-vb

authored a paper about 2 months ago

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Paper • 2411.07975 • Published Nov 12, 2024 • 27

authored a paper 3 months ago

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Paper • 2410.13848 • Published Oct 17, 2024 • 32

New activity in deepseek-ai/DeepSeek-Coder-V2-Instruct-0724 3 months ago

Update README.md

#5 opened 3 months ago by

xianbao

updated 2 models 3 months ago

deepseek-ai/DeepSeek-Coder-V2-Instruct-0724

Text Generation • Updated Oct 8, 2024 • 637 • 97

deepseek-ai/DeepSeek-V2.5

Text Generation • Updated 24 days ago • 11.3k • 682

New activity in deepseek-ai/DeepSeek-V2.5 3 months ago

Update metadata

#9 opened 4 months ago by

xianbao

New activity in deepseek-ai/DeepSeek-Coder-V2-Instruct 5 months ago

Add base_model metadata

#8 opened 5 months ago by

davanstrien

updated a model 5 months ago

deepseek-ai/DeepSeek-Coder-V2-Instruct

Text Generation • Updated Aug 21, 2024 • 153k • 520

New activity in deepseek-ai/DeepSeek-Prover-V1.5-SFT 5 months ago

Add base_model metadata

#2 opened 5 months ago by

davanstrien

New activity in deepseek-ai/DeepSeek-Prover-V1.5-RL 5 months ago

Add base_model metadata

#3 opened 5 months ago by

davanstrien

authored 6 papers 5 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15, 2024 • 52

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 58

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23, 2024 • 37

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 40

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11, 2024 • 44

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 41