wenyang's picture

2 9 17

wenyang

notoookay

·

AI & ML interests

NLP, RL

Recent Activity

upvoted a collection 7 days ago

upvoted a collection 11 days ago

liked a model 11 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

View all activity

Organizations

notoookay's activity

upvoted a collection 7 days ago

🤖 Agents

21 items • Updated Dec 31, 2024 • 108

upvoted a collection 11 days ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 491

upvoted a collection 23 days ago

OpenCulture

A multilingual dataset of public domain books and newspapers. • 27 items • Updated Nov 6, 2024 • 123

upvoted an article 4 months ago

Article

Accelerate 1.0.0

Sep 13, 2024

• 52

upvoted a collection 5 months ago

Tulu V2.5 Suite

A suite of models trained using DPO and PPO across a wide variety (up to 14) of preference datasets. See https://arxiv.org/abs/2406.09279 for more! • 44 items • Updated 26 days ago • 15

upvoted a collection 9 months ago

[lecture artifacts] aligning open language models

artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin • 63 items • Updated Apr 17, 2024 • 56

upvoted an article 10 months ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18, 2024

• 282

upvoted a paper 10 months ago

Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

Paper • 2404.08197 • Published Apr 12, 2024 • 28