@grimjim on Hugging Face: "I'm (finally) releasing a Python script that trims excess weights in Gemma2…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

grimjim

posted an update 5 days ago

Post

2543

I'm (finally) releasing a Python script that trims excess weights in Gemma2 full-weight models that bloated by ~1B parameters due to an early mergekit bug.
https://github.com/jim-plus/Gemma2-mergekit-remediation

I'd noticed something was off when merges of Gemma2 9B models ended up having ~10B parameters. The current mergekit package is fine, but there are still bloated models on HF that could stand to be fixed.

The script assumes that it will be run from the same directory as the model weights, and will trim the unnecessary lm_head.weight tensor and corresponding index entry.

agentlans

4 days ago

Anything to reduce LLM size is great. I can't stand the bloat on my hard drive and GPU.

grimjim

about 13 hours ago

Example of a fixed model.
https://huggingface.co/grimjim/lemon07r_Gemma-2-Ataraxy-v4c-9B_fixed

In this post

grimjim Jim Lai
agentlans Alan Tseng