Valeriy Selitskiy's picture

Valeriy Selitskiy

WaveCut

AI & ML interests

Looking to switch from hobby to career

Recent Activity

liked a model about 11 hours ago
ohayonguy/PMRF_blind_face_image_restoration
liked a Space about 16 hours ago
FaceOnLive/Face-Search-Online
liked a model 1 day ago
stabilityai/stable-fast-3d
View all activity

Organizations

Vikhr models's profile picture MLX Community's profile picture AI Art Collaboration space's profile picture

WaveCut's activity

reacted to grimjim's post with ๐Ÿ‘ 2 days ago
view post
Post
2434
I'm (finally) releasing a Python script that trims excess weights in Gemma2 full-weight models that bloated by ~1B parameters due to an early mergekit bug.
https://github.com/jim-plus/Gemma2-mergekit-remediation

I'd noticed something was off when merges of Gemma2 9B models ended up having ~10B parameters. The current mergekit package is fine, but there are still bloated models on HF that could stand to be fixed.

The script assumes that it will be run from the same directory as the model weights, and will trim the unnecessary lm_head.weight tensor and corresponding index entry.
  • 1 reply
ยท