Running 49 π Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks
Running on CPU Upgrade 77 π Open LLM Leaderboard Model Comparator Compare Open LLM Leaderboard results