Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ashishsr
/
rlhf_dxtoicd_reward_adapter
like
0
PEFT
Safetensors
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Use this model
main
rlhf_dxtoicd_reward_adapter
Commit History
Upload tokenizer
875c871
verified
ashishsr
commited on
Feb 8, 2024
basic training
2a6e01e
verified
ashishsr
commited on
Feb 8, 2024
initial commit
40f2f97
verified
ashishsr
commited on
Feb 8, 2024