Jim Lai

grimjim

AI & ML interests

Experimenting primarily with 7B-12B parameter text completion models. Not all models are intended for direct use, but aim for educational and/or merge purposes.

Recent Activity

updated a collection about 15 hours ago

Mirrored mergekit-ready models

updated a model 2 days ago

grimjim/lemon07r_Gemma-2-Ataraxy-v4c-9B_fixed

posted an update 3 days ago

I'm (finally) releasing a Python script that trims excess weights in Gemma2 full-weight models that bloated by ~1B parameters due to an early mergekit bug. https://github.com/jim-plus/Gemma2-mergekit-remediation I'd noticed something was off when merges of Gemma2 9B models ended up having ~10B parameters. The current mergekit package is fine, but there are still bloated models on HF that could stand to be fixed. The script assumes that it will be run from the same directory as the model weights, and will trim the unnecessary lm_head.weight tensor and corresponding index entry.

View all activity

Organizations

Posts 17

Post

2427

I'm (finally) releasing a Python script that trims excess weights in Gemma2 full-weight models that bloated by ~1B parameters due to an early mergekit bug.
https://github.com/jim-plus/Gemma2-mergekit-remediation

I'd noticed something was off when merges of Gemma2 9B models ended up having ~10B parameters. The current mergekit package is fine, but there are still bloated models on HF that could stand to be fixed.

The script assumes that it will be run from the same directory as the model weights, and will trim the unnecessary lm_head.weight tensor and corresponding index entry.

Post

1388

A reminder that literal base models are valid choices for base model in task arithmetic mergers. Each Instruct or fine-tuned model then becomes a vector against the base model. Example merge formula used can be found via this model page.
grimjim/Magnolia-v3-12B

View all posts

Collections 5

models 120

datasets 2

grimjim/PAlign-PAPI-personality_prompt.json-cleaned

Viewer • Updated Sep 21, 2024 • 300 • 34

grimjim/adversarial-10-alpaca

Viewer • Updated Aug 16, 2024 • 10 • 32 • 1

Jim Lai

AI & ML interests

Recent Activity

Organizations

Posts 17

Collections 5

grimjim/llama-3-Nephilim-v3-8B

grimjim/llama-3-Nephilim-v3-8B-GGUF

grimjim/Llama-3.1-8B-Instruct-abliterated_via_adapter

grimjim/Llama-3.1-8B-Instruct-abliterated_via_adapter-GGUF

grimjim/kuno-kunoichi-v1-DPO-v2-SLERP-7B

grimjim/kukulemon-7B

grimjim/kukulemon-spiked-9B

grimjim/kukulemon-32K-7B

models 120

grimjim/lemon07r_Gemma-2-Ataraxy-v4c-9B_fixed

grimjim/HuatuoSkywork-o1-Llama-3.1-8B

grimjim/Magnolia-v3-Gemma2-8k-9B

grimjim/Gigantes-v3-gemma2-9b-it

grimjim/Gigantes-v2-gemma2-9b-it

grimjim/Gigantes-v1-gemma2-9b-it

grimjim/Magnolia-v3-12B

grimjim/Magnolia-v2-12B

grimjim/mistralai-Mistral-Nemo-Instruct-2407

grimjim/magnum-twilight-12b

datasets 2

grimjim/PAlign-PAPI-personality_prompt.json-cleaned

grimjim/adversarial-10-alpaca

Jim Lai

AI & ML interests

Recent Activity

Organizations

Posts 17

Collections 5

models 120 Sort: Recently updated

datasets 2 Sort: Recently updated

models 120

datasets 2