752 60 279

Younes Belkada

ybelkada

AI & ML interests

Large Language Models, Quantization, Vision, Multimodality, Diffusion models

Articles

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 103

Introducing RWKV — An RNN with the advantages of a transformer

May 15, 2023

• 14

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Apr 5, 2023

• 22

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Mar 9, 2023

• 35

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Aug 17, 2022

• 66

Organizations

Posts 4

Post

2912

Falcon Mamba now available now in llama.cpp !
Check out GGUF files uploaded here: tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a

Post

3720

FalconMamba 7B - a new model from TII (Technology Innovation Institute) is out !

- Blogpost: https://huggingface.co/blog/falconmamba
- Link to collection: tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a
- Link to playground: tiiuae/falcon-mamba-playground

View all posts

Collections 1

Papers 8

spaces 25

Sleeping

🦙

GGUF My Repo

No application file

👀

Test Zero

Sleeping

🐠

Dlai Test 2

No application file

🚀

Blip Imagecaptioning Dlai

Running

⚡

Open Source List Models

Runtime error

🌖

Llava 1.5 Dlai

models 143

ybelkada/tiny-random-T5ForConditionalGeneration-calibrated

Text2Text Generation • Updated 23 days ago • 706k

ybelkada/t5-11b-sharded

Translation • Updated Nov 21, 2024 • 17 • 1

ybelkada/mpt-7b-bf16-sharded

Text Generation • Updated Nov 17, 2024 • 21

ybelkada/gpt-j-6b-sharded-bf16

Text Generation • Updated Nov 10, 2024 • 430 • 2

ybelkada/t5-3b-sharded

Text2Text Generation • Updated Oct 26, 2024 • 52 • 1

ybelkada/test-gguf-trainer-Q8_0-GGUF

Updated May 28, 2024 • 6

ybelkada/test-gguf-trainer

Text Generation • Updated May 28, 2024 • 32 • 1

ybelkada/tiny-random-llama-Q6_K-GGUF

Updated May 28, 2024 • 6

ybelkada/test-gguf-trainer-Q4_K_M-GGUF

Updated May 27, 2024 • 1

ybelkada/tiny-random-llama-Q4_K_M-GGUF

Updated May 22, 2024 • 3

datasets 12

ybelkada/model_cards_correct_tag

Viewer • Updated Mar 19, 2024 • 54 • 42

ybelkada/model-info-library-name

Updated Jan 23, 2024 • 15

ybelkada/test-model-info-library-name

Viewer • Updated Jan 23, 2024 • 1 • 39

ybelkada/documentation-images

Viewer • Updated Jan 19, 2024 • 2 • 32k

ybelkada/oasst1-tiny-subset

Viewer • Updated May 11, 2023 • 44.1k • 42 • 2

ybelkada/oasst1

Viewer • Updated May 11, 2023 • 44.1k • 46 • 1

ybelkada/food101-tiny

Viewer • Updated May 5, 2023 • 100 • 38

ybelkada/test-onepiece-dataset

Viewer • Updated May 5, 2023 • 10 • 39

ybelkada/common_voice_mr_11_0_copy

Viewer • Updated Apr 4, 2023 • 10.8k • 200

ybelkada/english_quotes_copy

Viewer • Updated Apr 4, 2023 • 2.51k • 4.32k

Younes Belkada

AI & ML interests

Articles

Welcome FalconMamba: The first strong attention-free 7B model

Welcome Llama 3 - Meta's new open LLM

GaLore: Advancing Large Model Training on Consumer-grade Hardware

quanto: a pytorch quantization toolkit

Fine-Tuning Gemma Models in Hugging Face

Mixture of Experts Explained

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Overview of natively supported quantization schemes in 🤗 Transformers

Making LLMs lighter with AutoGPTQ and transformers

Fine-tune Llama 2 with DPO

The Falcon has landed in the Hugging Face ecosystem