Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
jsingh
/
dpo_rlaif_v0.1
like
0
Transformers
Safetensors
Inference Endpoints
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
dpo_rlaif_v0.1
1 contributor
History:
2 commits
jsingh
Upload MixtralForCausalLM
7543983
verified
11 months ago
.gitattributes
Safe
1.52 kB
initial commit
11 months ago
README.md
Safe
5.18 kB
Upload MixtralForCausalLM
11 months ago
adapter_config.json
Safe
612 Bytes
Upload MixtralForCausalLM
11 months ago
adapter_model.safetensors
Safe
27.3 MB
LFS
Upload MixtralForCausalLM
11 months ago
generation_config.json
Safe
111 Bytes
Upload MixtralForCausalLM
11 months ago