Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
rAIfle
/
SorcererLM-8x22b-bf16
like
16
Safetensors
mixtral
License:
apache-2.0
Model card
Files
Files and versions
Community
3
Train
refs/pr/1
SorcererLM-8x22b-bf16
/
train
/
sorc_ds.json
rAIfle
Upload 2 files
4fa1155
verified
5 months ago
raw
Copy download link
history
blame
Safe
138 Bytes
{
"train_micro_batch_size_per_gpu"
:
1
,
"gradient_accumulation_steps"
:
2
,
"gradient_clipping"
:
1.0
,
"steps_per_print"
:
1
}