Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
akaistormherald
/
ToxicMist-v0.2-7B-DPO-LORA
like
0
Transformers
Safetensors
unalignment/toxic-dpo-v0.2
English
text-generation-inference
unsloth
mistral
trl
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
ToxicMist-v0.2-7B-DPO-LORA
Commit History
Use_rslora causing mergebox issues
9acebf0
verified
akaistormherald
commited on
Mar 10, 2024
Update adapter_config.json
d19a3c5
verified
akaistormherald
commited on
Mar 8, 2024
Update README.md
2a020f0
verified
akaistormherald
commited on
Mar 5, 2024
Update README.md
8037b2d
verified
akaistormherald
commited on
Mar 5, 2024
Update README.md
4eafab6
verified
akaistormherald
commited on
Mar 5, 2024
Trained with Unsloth
7618a95
verified
akaistormherald
commited on
Mar 5, 2024
Trained with Unsloth
a9466e3
verified
akaistormherald
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
d932763
verified
akaistormherald
commited on
Mar 5, 2024
initial commit
b752c0f
verified
akaistormherald
commited on
Mar 5, 2024