Running
on
CPU Upgrade
12.1k
🏆
Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
Discover amazing AI apps made by the community!
Track, rank and evaluate open LLMs and chatbots
VLMEvalKit Evaluation Results Collection
AI Phone Leaderboard
A benchmark for open-source multi-dialect Arabic ASR models
Korean Leaderboard
Track, rank and evaluate open LLMs' CoT quality
Track, rank and evaluate open LLMs and chatbots
Track, rank and evaluate open LLMs and chatbots
Compare Open LLM Leaderboard results
Persian Text Embedding Benchmark