|
--- |
|
language: |
|
- ko |
|
- en |
|
license: mit |
|
--- |
|
|
|
# Model Card for free-solar-dpo-v0.1 |
|
|
|
## Developed by : [Freewheelin](https://freewheelin-recruit.oopy.io/) AI Technical Team |
|
|
|
## Hardware and Software |
|
|
|
* **Training Factors**: We fine-tuned this model using the [HuggingFace TRL Trainer](https://huggingface.co/docs/trl/trainer) |
|
|
|
## Method |
|
- This model was trained using the learning method introduced in the [SOLAR paper](https://arxiv.org/pdf/2312.15166.pdf). |
|
|
|
## Base Model |
|
- [freewheelin/free-solar-slerp-v0.2](https://huggingface.co/freewheelin/free-solar-slerp-v0.2) |
|
|