Jeremie Tisby

Frobenius

AI & ML interests

None yet

Recent Activity

liked a model about 7 hours ago

tensorblock/Sky-T1-32B-Preview-GGUF

liked a model 5 days ago

omkarthawakar/LlamaV-o1

liked a model about 1 month ago

openai/whisper-large-v3-turbo

View all activity

Organizations

Frobenius's activity

liked a model about 7 hours ago

tensorblock/Sky-T1-32B-Preview-GGUF

Updated 9 days ago • 503 • 2

liked a model 5 days ago

omkarthawakar/LlamaV-o1

Question Answering • Updated 8 days ago • 2.58k • 76

liked 3 models about 1 month ago

liked a Space about 1 month ago

Running

477

📈

Scaling test-time compute

replied to lewtun's post about 1 month ago

Wow people... This is CRACKED! THANK YOU HF!!!

reacted to lewtun's post with 🔥 about 1 month ago

Post

6751

We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute 🔥

How? By combining step-wise reward models with tree search algorithms :)

We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think"

We're open sourcing the full recipe and sharing a detailed blog post.

In our blog post we cover:

📈 Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time.

🎄 Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets.

🧭 Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM

Here's the links:

- Blog post: HuggingFaceH4/blogpost-scaling-test-time-compute

- Code: https://github.com/huggingface/search-and-learn

Enjoy!