library_name: transformers | |
license: other | |
license_name: qwen-research | |
license_link: https://huggingface.co/Qwen/Qwen2.5-3B-Instruct/blob/main/LICENSE | |
base_model: Qwen/Qwen2.5-3B | |
tags: | |
- generated_from_trainer | |
model-index: | |
- name: outputs/gelato-3b | |
results: [] | |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You | |
should probably proofread and complete it, then remove this comment. --> | |
Prompt Format: **ChatML** | |
Trained Datasets: | |
- [arcee-ai/EvolKit-20k](https://huggingface.co/datasets/arcee-ai/EvolKit-20k) | |
- [LDJnr/Capybara](https://huggingface.co/datasets/LDJnr/Capybara) | |
- and a private dataset | |
GGUFs: https://huggingface.co/mradermacher/raspberry-3B-GGUF | |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/630430583926de1f7ec62c6b/L45Szb9WeV-K_bxS8aFoj.png) | |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/630430583926de1f7ec62c6b/GQtNdAaoXZXwf4noU883B.png) | |