Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a Space about 20 hours ago

HuggingFaceH4/blogpost-scaling-test-time-compute

new activity 1 day ago

HuggingFaceH4/blogpost-scaling-test-time-compute:Questions about Verifier Development, Search as Data Generation Tool, and Model Family Alignment

liked a model 1 day ago

Skywork/Skywork-o1-Open-PRM-Qwen-2.5-1.5B

View all activity

Articles

Universal Assisted Generation: Faster Decoding with Any Assistant Model

Faster Assisted Generation with Dynamic Speculation

Llama can now see and run on your device - welcome Llama 3.2

FineVideo: behind the scenes

How NuminaMath Won the 1st AIMO Progress Prize

Welcome Gemma 2 - Google's new open LLM

Constitutional AI with Open LLMs

Preference Tuning LLMs with Direct Preference Optimization Methods

Mixture of Experts Explained

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit

Fine-tuning Llama 2 70B using PyTorch FSDP

Code Llama: Llama 2 learns to code

Llama 2 is here - get it on Hugging Face

Can foundation models label data like humans?

The Falcon has landed in the Hugging Face ecosystem

Creating a Coding Assistant with StarCoder

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Red-Teaming Large Language Models

Diffusion Models Live Event

Very Large Language Models and How to Evaluate Them

SetFit: Efficient Few-Shot Learning Without Prompts

Announcing Evaluation on the Hub

Organizations

lewtun's activity

upvoted a paper 2 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 25 days ago • 72

upvoted a paper 3 days ago

Evaluating Large Language Models Trained on Code

Paper • 2107.03374 • Published Jul 7, 2021 • 8

upvoted 2 papers 4 days ago

Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models

Paper • 1610.02424 • Published Oct 7, 2016 • 1

Scaling Laws for Neural Language Models

Paper • 2001.08361 • Published Jan 23, 2020 • 7

upvoted a paper 5 days ago

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published 8 days ago • 10

upvoted a paper 14 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 17 days ago • 116

upvoted a collection 14 days ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 15 days ago • 112

upvoted a paper 19 days ago

Solving math word problems with process- and outcome-based feedback

Paper • 2211.14275 • Published Nov 25, 2022 • 7

upvoted a paper 20 days ago

Self-Consistency Improves Chain of Thought Reasoning in Language Models

Paper • 2203.11171 • Published Mar 21, 2022 • 3

upvoted a collection 2 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 12 days ago • 197

upvoted 2 papers 2 months ago

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25, 2024 • 82

AutoTrain: No-code training for state-of-the-art models

Paper • 2410.15735 • Published Oct 21, 2024 • 59

upvoted a paper 3 months ago

Falcon Mamba: The First Competitive Attention-free 7B Language Model

Paper • 2410.05355 • Published Oct 7, 2024 • 31

upvoted an article 3 months ago

Article

Faster Assisted Generation with Dynamic Speculation

Oct 8, 2024

• 44

upvoted a collection 3 months ago

Critique-out-Loud Reward Models

Paper: https://arxiv.org/abs/2408.11791 | Code: https://github.com/zankner/CLoud • 7 items • Updated Sep 5, 2024 • 3

upvoted a paper 3 months ago

Style over Substance: Failure Modes of LLM Judges in Alignment Benchmarking

Paper • 2409.15268 • Published Sep 23, 2024 • 13

upvoted a paper 4 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 124

upvoted an article 5 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

By

•

Aug 19, 2024

• 75

upvoted a paper 5 months ago

Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

Paper • 2408.07199 • Published Aug 13, 2024 • 21

upvoted an article 5 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14, 2024

• 56