Ali El Filali's picture

Ali El Filali

alielfilali01

·

AI & ML interests

AI Psychometrician ? | NLP (mainly for Arabic) | Other interests include Reinforcement Learning and Cognitive sciences among others

Recent Activity

liked a dataset about 14 hours ago

Qwen/CodeElo

upvoted a paper about 17 hours ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

updated a dataset 1 day ago

OALL/requests

View all activity

Articles

Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard

Introducing the Open Arabic LLM Leaderboard

Organizations

alielfilali01's activity

upvoted a paper about 17 hours ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 1 day ago • 30

upvoted a paper 2 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 26 days ago • 72

upvoted a collection 5 days ago

Deepseek Papers

Deepseek papers collection • 14 items • Updated 5 days ago • 9

upvoted 2 papers 5 days ago

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published 8 days ago • 10

Data Laundering: Artificially Boosting Benchmark Results through Knowledge Distillation

Paper • 2412.15255 • Published 19 days ago • 3

upvoted a paper 15 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 15 days ago • 334

upvoted a collection 16 days ago

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 16 days ago • 75

upvoted a collection 19 days ago

Multilingual LLM Evaluation

Multilingual Evaluation Benchmarks • 6 items • Updated 22 days ago • 9

upvoted a paper 19 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published 23 days ago • 95

upvoted 3 collections 22 days ago

🧪 FineWeb v1 data experiments

Ablation models trained for our data experiments. • 22 items • Updated Jun 12, 2024 • 4

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12, 2024 • 35

AraDICE

AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs • 12 items • Updated 22 days ago • 4

upvoted a collection 26 days ago

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 21 days ago • 122

upvoted 2 articles 26 days ago

Article

Rethinking Backpropagation: Thoughts on What's Wrong with Backpropagation

By

•

Dec 2, 2024

• 5

Article

Finding Moroccan Arabic (Darija) in Fineweb 2

By

•

26 days ago

• 20

upvoted a collection 27 days ago

🥂 FineWeb2

3 items • Updated 27 days ago • 11

upvoted an article 30 days ago

Article

Comparing Open-source and Proprietary LLMs in Medical AI

By

•

Oct 3, 2024

• 16

upvoted a paper about 1 month ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 39

upvoted 2 articles about 1 month ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

By

•

Nov 21, 2024

• 35

Article

Halo: Open Source Health Tracking with Wearables

By

•

Nov 19, 2024

• 99