Clémentine Fourrier's picture

Clémentine Fourrier

clefourrier

·

http://clefourrier.github.io

AI & ML interests

None yet

Recent Activity

liked a Space about 16 hours ago

mvaloatto/TCTF

updated a Space about 17 hours ago

science/README

new activity about 19 hours ago

open-llm-leaderboard/open_llm_leaderboard:Carbon Dioxide Emissions

View all activity

Articles

Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard

Introduction to the Open Leaderboard for Japanese LLMs

Letting Large Models Debate: The First Multilingual LLM Debate Competition

Judge Arena: Benchmarking LLMs as Evaluators

Introducing the Open FinLLM Leaderboard

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens tokens and 11 languages

CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models

Let's talk about LLM evaluation

Introducing the Open Arabic LLM Leaderboard

Introducing the Open Leaderboard for Hebrew LLMs!

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

Improving Prompt Consistency with Structured Generations

Introducing the Open Chain of Thought Leaderboard

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs

Introducing the Chatbot Guardrails Arena

Introducing ConTextual: How well can your Multimodal model jointly reason over text and image in text-rich scenes?

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

Introducing the Red-Teaming Resistance Leaderboard

Introducing the Open Ko-LLM Leaderboard: Leading the Korean LLM Evaluation Ecosystem

NPHardEval Leaderboard: Unveiling the Reasoning Abilities of Large Language Models through Complexity Classes and Dynamic Updates

Introducing the Enterprise Scenarios Leaderboard: a Leaderboard for Real World Use Cases

The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models

A guide to setting up your own Hugging Face leaderboard: an end-to-end example with Vectara's hallucination leaderboard

2023, year of open LLMs

Open LLM Leaderboard: DROP deep dive

Overview of natively supported quantization schemes in 🤗 Transformers

What's going on with the Open LLM Leaderboard?

Introduction to Graph Machine Learning

Organizations

clefourrier's activity

liked a Space about 16 hours ago

Top Contributors To Follow

Meet the most impactful users on Hugging Face

liked a Space 1 day ago

2024 AI Timeline

liked 3 Spaces 9 days ago

TheWell

Visualization of data from the Well

Materials Explorer

Phase Diagram

liked a model 17 days ago

utter-project/EuroLLM-1.7B

Text Generation • Updated Nov 27, 2024 • 4.74k • 50

liked 2 Spaces 18 days ago

Number Tokenization Blog

Scaling test-time compute

liked a Space 19 days ago

Fev Leaderboard

liked a Space 23 days ago

Edge LLM Leaderboard

liked a model 23 days ago

tencent/Tencent-Hunyuan-Large

Text Generation • Updated Nov 24, 2024 • 70 • 537

liked a Space 24 days ago

Background Removal Arena

liked a dataset 24 days ago

open-llm-leaderboard/contents

Viewer • Updated about 1 hour ago • 2.75k • 14.9k • 9

liked a dataset 25 days ago

apple/GSM-Symbolic

Viewer • Updated 25 days ago • 12.5k • 166 • 6

liked a Space 26 days ago

Social Impact Dashboard

liked a dataset 30 days ago

CohereForAI/include-base-44

Viewer • Updated 24 days ago • 23.7k • 1.96k • 22

liked a Space about 1 month ago

Open Source Ai Year In Review 2024

What happened in open-source AI this year, and what’s next?

liked a model about 1 month ago

Qwen/Qwen2.5-7B

Text Generation • Updated Sep 25, 2024 • 101k • 94

liked 2 Spaces about 1 month ago

Running on CPU Upgrade

Toxicity Benchmarking

Open ASR Leaderboard