cloudyu's picture
Update README.md
437230c verified
|
raw
history blame
437 Bytes
---
license: apache-2.0
language:
- fr
- it
- de
- es
- en
tags:
- moe
---
# Model Card for cloudyu/Mixtral-8x7B-Instruct-v0.1-DPO
* [try to improve mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) by DPO training
* [DPO Trainer](https://huggingface.co/docs/trl/main/en/dpo_trainer)
Metrics improved by Truthful DPO traingin after 100 steps
![Metrsc improment](mixtral-dpo.jpg)