Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
chchen
/
Llama-3.1-8B-Instruct-ppo-500
like
0
PEFT
Safetensors
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Use this model
main
Llama-3.1-8B-Instruct-ppo-500
/
training_reward.png
chchen
Upload 13 files
ede28fe
verified
2 days ago
download
Copy download link
history
contribute
delete
38.9 kB