Serge Brun

surfiend

AI & ML interests

None yet

Recent Activity

liked a model 23 days ago

bartowski/agentica-org_DeepScaleR-1.5B-Preview-GGUF

liked a Space about 1 month ago

ggml-org/gguf-my-repo

liked a model about 1 month ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

View all activity

Organizations

None yet

surfiend's activity

liked a model 23 days ago

bartowski/agentica-org_DeepScaleR-1.5B-Preview-GGUF

Text Generation • Updated 24 days ago • 54.5k • 38

liked a Space about 1 month ago

1.28k

GGUF My Repo

🦙

liked a model about 1 month ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • Updated 11 days ago • 1.43M • • 992

updated a collection 5 months ago

Prompts

Collection

1 item • Updated Oct 17, 2024

upvoted 2 articles 5 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 225

Article

Document Similarity Search with ColPali

•

Sep 21, 2024

• 49

upvoted a paper 6 months ago

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 123

liked a model 7 months ago

city96/FLUX.1-dev-gguf

Text-to-Image • Updated Aug 18, 2024 • 242k • 917

liked a Space 8 months ago

12.7k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

liked a model 10 months ago

gradientai/Llama-3-8B-Instruct-Gradient-1048k

Text Generation • Updated Oct 29, 2024 • 5.09k • 681

upvoted a paper 11 months ago

Octopus v2: On-device language model for super agent

Paper • 2404.01744 • Published Apr 2, 2024 • 58

liked 2 models 12 months ago

jinaai/jina-colbert-v1-en

Updated Jan 6 • 1.08k • 99

stabilityai/TripoSR

Image-to-3D • Updated Aug 9, 2024 • 33.1k • 525

liked a model about 1 year ago

01-ai/Yi-9B

Text Generation • Updated Nov 11, 2024 • 2.05k • 185

upvoted 2 papers about 1 year ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 610

EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Paper • 2402.17485 • Published Feb 27, 2024 • 191

liked a Space about 1 year ago

304

GAIA Leaderboard

🦾

Submit and evaluate models for a leaderboard

reacted to osanseviero's post with 👍 about 1 year ago

Post

Mixture of experts: beware 🛡️⚔️

New paper by DeepMind: Buffer Overflow in MoE Buffer Overflow in Mixture of Experts (2402.05526)

The paper shows an adversarial attack strategy in which a user sends malicious queries that can affect the output of other user queries from the same batch.

So if in the same batch we have
- User A benign query
- User B malicious query
The response for A might be altered!😱

How is this possible?
One approach is to fill the token buffers with adversarial data, hence forcing the gating to use the non-ideal experts or to entirely drop the bening tokens (in the case of finite limit size).

This assumes that the adversary can use the model as a black-box but can observe the logit outputs + ensure that the data is always grouped in the same batch.

How to mitigate this?
- Randomize batch order (and even run twice if some queries are very sensitive)
- Use a large capacity slack
- Sample from gate weights instead of top-k (not great IMO, as that require more memory for inference)

Very cool paper!!

621 replies

liked a model about 1 year ago

ise-uiuc/Magicoder-S-DS-6.7B

Text Generation • Updated Mar 6, 2024 • 733 • 203