Ericu950's picture
Upload README.md with huggingface_hub
7542075 verified
|
raw
history blame
1.13 kB
metadata
base_model: []
library_name: transformers
tags:
  - mergekit
  - merge

PapyLlamaMerged

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the TIES merge method using /mimer/NOBACKUP/groups/naiss2024-22-361/Eric_Pap/Llama-3.1-8B-Instruct as a base.

Models Merged

The following models were included in the merge:

  • /mimer/NOBACKUP/groups/naiss2024-22-201/PapInsc3/Papyllama2

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: /mimer/NOBACKUP/groups/naiss2024-22-361/Eric_Pap/Llama-3.1-8B-Instruct
  - model: /mimer/NOBACKUP/groups/naiss2024-22-201/PapInsc3/Papyllama2
    parameters:
      density: 1.1  # Fixed density, slightly more sparse than the original
      weight: 0.6  # Fixed weight to keep the fine-tuned model's influence high
merge_method: ties
base_model: /mimer/NOBACKUP/groups/naiss2024-22-361/Eric_Pap/Llama-3.1-8B-Instruct
parameters:
  normalize: true
dtype: bfloat16