Sleeping
🥇
Discover amazing AI apps made by the community!
Measuring the gap across models for CoT reasoning in Spanish
Dipromats 2024 Task 2 Leaderboard
Track, rank and evaluate open LLMs and chatbots
Benchmark the ability of LLMs to produce secure code.