Maxime Labonne's picture

Maxime Labonne PRO

mlabonne

·

https://mlabonne.github.io/blog

AI & ML interests

Post-training, model editing, quantization

Articles

Decoding Strategies in Large Language Models

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

The Rise of Agentic Data Generation

Uncensor any LLM with abliteration

Fine-tune Llama 3 with ORPO

Create Mixtures of Experts with MergeKit

Merge Large Language Models with mergekit

Organizations

mlabonne's activity

upvoted 2 articles about 2 months ago

Article

The Beginners Guide to Cleaning a Dataset

By

•

Nov 18, 2024

• 24

Article

Releasing the largest multilingual open pretraining dataset

By

•

Nov 13, 2024

• 98

upvoted an article 2 months ago

Article

Decoding Strategies in Large Language Models

By

•

Oct 29, 2024

• 38

upvoted a paper 2 months ago

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

Paper • 2402.14740 • Published Feb 22, 2024 • 12

upvoted an article 3 months ago

Article

VLM Art Analysis

By

•

Oct 4, 2024

• 11

upvoted an article 4 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 215

upvoted a collection 5 months ago

🧠 Abliteration

Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration • 7 items • Updated Nov 18, 2024 • 24

upvoted an article 5 months ago

Article

Introduction to ggml

Aug 13, 2024

• 125

upvoted a paper 5 months ago

The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines

Paper • 2408.01050 • Published Aug 2, 2024 • 8

upvoted an article 5 months ago

Article

The case for specialized pre-training: ultra-fast foundation models for dedicated tasks

By

•

Aug 4, 2024

• 27

upvoted a paper 5 months ago

Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning

Paper • 2408.00690 • Published Aug 1, 2024 • 23

upvoted a collection 5 months ago

Probably function calling datasets

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17, 2024 • 37

upvoted 2 papers 5 months ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 30

Understanding Reference Policies in Direct Preference Optimization

Paper • 2407.13709 • Published Jul 18, 2024 • 16

upvoted 3 collections 5 months ago

Bad Data Toolbox

PleIAs collection of models for the data processing of challenging document and data sources. • 5 items • Updated Jul 18, 2024 • 15

OpenCulture

A multilingual dataset of public domain books and newspapers. • 27 items • Updated Nov 6, 2024 • 121

Finance Commons

A large collection of multimodal financial documents in open data. • 7 items • Updated Jul 17, 2024 • 7

upvoted an article 5 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29, 2024

• 260

upvoted a paper 5 months ago

The Importance of Online Data: Understanding Preference Fine-tuning via Coverage

Paper • 2406.01462 • Published Jun 3, 2024 • 6

upvoted an article 5 months ago

Article

The Rise of Agentic Data Generation

By

•

Jul 15, 2024

• 79