library_name: transformers | |
tags: [] | |
An experimental coding instruct model. This is a full finetune of DeepSeek-Coder-Instruct-1.3B for 15 hours on 1xA6000 using a bespoke distillation trainer. | |
library_name: transformers | |
tags: [] | |
An experimental coding instruct model. This is a full finetune of DeepSeek-Coder-Instruct-1.3B for 15 hours on 1xA6000 using a bespoke distillation trainer. | |