metadata
license: apache-2.0
language:
- en
library_name: transformers
tags:
- prune
- notus7b
- Arcee
base_model: argilla/notus-7b-v1
pipeline_tag: text-generation
new_version: AINovice2005/LeEmpereur-final
Model Name:
- LeEmpereur_70
Model Description
The pruning was performed using the PruneMe library from Arcee.ai, significantly reducing the model's size. The exact pruning strategy applied involves reducing the number of parameters by approximately 70%.
Configuration:
The following YAML configuration was used to produce this model:
slices:
- sources:
- model: argilla/notus-7b-v1
layer_range: [0, 1]
- sources:
- model: argilla/notus-7b-v1
layer_range: [2,10]
merge_method: passthrough
dtype: bfloat16
๐๐๐ฌ๐ฎ๐ฅ๐ญ๐ฌ: Firstly, the ideal number of parameters to be pruned should be much lower in future iterations.Secondly, sizeable amount of finetuning should be done if model parameters are reduced to a greater extent.
๐๐จ๐ญ๐: This model is made with the intention to be used for fine-tuning. It should not to be used for inference as is.