Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
mlabonne
/
TwinLlama-3.1-8B-DPO
like
8
Text Generation
Transformers
Safetensors
English
llama
text-generation-inference
unsloth
trl
dpo
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
4a76d14
TwinLlama-3.1-8B-DPO
Commit History
Trained with Unsloth
4a76d14
verified
mlabonne
commited on
Oct 6, 2024
Upload tokenizer
adc761e
verified
mlabonne
commited on
Oct 6, 2024
Trained with Unsloth
d7cd5a8
verified
mlabonne
commited on
Sep 7, 2024
Trained with Unsloth
7139153
verified
mlabonne
commited on
Sep 6, 2024
Trained with Unsloth
067877b
verified
mlabonne
commited on
Sep 6, 2024
Trained with Unsloth
8132c2b
verified
mlabonne
commited on
Sep 6, 2024
Trained with Unsloth
24962a1
verified
mlabonne
commited on
Sep 6, 2024
Trained with Unsloth
bc61df6
verified
mlabonne
commited on
Sep 6, 2024
Trained with Unsloth
35e13fa
verified
mlabonne
commited on
Sep 6, 2024
Trained with Unsloth
33f2bf3
verified
mlabonne
commited on
Sep 6, 2024
Trained with Unsloth
bee6f83
verified
mlabonne
commited on
Sep 5, 2024
Trained with Unsloth
230c418
verified
mlabonne
commited on
Sep 5, 2024
Trained with Unsloth
360582c
verified
mlabonne
commited on
Sep 5, 2024
Trained with Unsloth
49ea886
verified
mlabonne
commited on
Aug 29, 2024
Upload tokenizer
32b538a
verified
mlabonne
commited on
Aug 29, 2024
Upload README.md with huggingface_hub
4894472
verified
mlabonne
commited on
Aug 29, 2024
initial commit
a3f0248
verified
mlabonne
commited on
Aug 29, 2024