3 44 118

YangWang92

yangwang92

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

OpenGVLab/InternVL2-Llama3-76B

liked a model 4 days ago

pentagoniac/SEMIKONG-70B

upvoted a collection 5 days ago

DeepSeek-V3 BF16

View all activity

Organizations

yangwang92's activity

liked a model 2 days ago

OpenGVLab/InternVL2-Llama3-76B

Image-Text-to-Text • Updated 14 days ago • 46.4k • 210

liked a model 4 days ago

pentagoniac/SEMIKONG-70B

Text Generation • Updated Jul 13, 2024 • 3.76k • 20

upvoted a collection 5 days ago

DeepSeek-V3 BF16

Collection

2 items • Updated 6 days ago • 3

liked a model 6 days ago

deepseek-ai/DeepSeek-V3

Updated 2 days ago • 30.9k • 825

upvoted a paper 6 days ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published 9 days ago • 37

liked a model 6 days ago

deepseek-ai/DeepSeek-V3-Base

Updated 2 days ago • 5.64k • 1.02k

liked a Space 7 days ago

Running

338

🌍

QVQ 72B Preview

liked 2 models 8 days ago

facebook/SONAR

Updated Feb 14, 2024 • 38

OpenGVLab/InternVL2_5-78B-MPO

Image-Text-to-Text • Updated 10 days ago • 406 • 23

upvoted 2 papers 8 days ago

OpenAI o1 System Card

Paper • 2412.16720 • Published 11 days ago • 27

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15, 2024 • 68

upvoted a collection 8 days ago

InternVL2.5-MPO

Collection

Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated about 23 hours ago • 23

liked a Space 9 days ago

Running

410

📈

Scaling test-time compute

upvoted a collection 10 days ago

long-cot-dataset

Collection

16 items • Updated 10 days ago • 3

upvoted a paper 11 days ago

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Paper • 2407.21787 • Published Jul 31, 2024 • 12

liked a model 12 days ago

Qwen/Qwen2.5-Math-RM-72B

Text Classification • Updated Oct 31, 2024 • 10.3k • 66

upvoted a paper 12 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 13 days ago • 333

upvoted a paper 14 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published 19 days ago • 79

upvoted a paper 15 days ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 53

liked a dataset 15 days ago

meta-llama/Llama-3.2-1B-Instruct-evals

Viewer • Updated Sep 25, 2024 • 142k • 368 • 18