Unchun Yang's picture

Unchun Yang

ucyang

·

https://ucyang.com/

AI & ML interests

None yet

Recent Activity

upvoted an article about 4 hours ago

🌁#83: GAN is back

liked a model about 8 hours ago

Cohere/rerank-v3.5

liked a dataset about 8 hours ago

CohereForAI/Global-MMLU

View all activity

Organizations

ucyang's activity

upvoted an article about 4 hours ago

Article

🌁#83: GAN is back

By

•

6 days ago

• 7

upvoted 5 collections about 9 hours ago

Multilingual LLM Evaluation

Multilingual Evaluation Benchmarks • 6 items • Updated Dec 13, 2024 • 10

Aya Datasets

The Aya Collection is a massive multilingual collection for over 100 languages consisting of 513 million instances of prompts and completions. • 5 items • Updated Dec 3, 2024 • 15

C4AI Command R

C4AI Command-R is a research release of a 35 billion parameter highly performant generative model. Command-R is a large language model with open weigh • 4 items • Updated Dec 3, 2024 • 21

C4AI Aya Expanse

Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 3 items • Updated Dec 16, 2024 • 31

Command Models

Latest C4AI Command models • 4 items • Updated 1 day ago • 5

upvoted a collection about 23 hours ago

Phi-4

Phi-4 small language model. • 2 items • Updated 11 days ago • 42

upvoted a paper 3 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 5 days ago • 259

upvoted a collection 5 days ago

MiniCPM

The MiniCPM family of LLMs and VLLMs. • 32 items • Updated about 7 hours ago • 59

upvoted a paper 6 days ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 11 days ago • 77

upvoted a paper 7 days ago

Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1, 2024 • 57

upvoted an article 8 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By

•

Dec 4, 2024

• 76

upvoted a paper 9 days ago

Quantifying the Carbon Emissions of Machine Learning

Paper • 1910.09700 • Published Oct 21, 2019 • 13

upvoted a paper 10 days ago

KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model

Paper • 2501.01028 • Published 17 days ago • 11

upvoted a collection 10 days ago

KaLM-embedding

6 items • Updated 4 days ago • 21

upvoted a collection 12 days ago

Deepseek V3 (All Versions)

Deepseek V3 - available in bf16, original, and GGUF formats, with support for 2, 3, 4, 5, 6 and 8-bit quantized versions. • 3 items • Updated 7 days ago • 24

upvoted a paper 15 days ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 74

upvoted a paper 16 days ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 78

upvoted a paper 18 days ago

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Paper • 2309.12284 • Published Sep 21, 2023 • 19

upvoted a paper 19 days ago

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15, 2024 • 44