Thinker
Collection
Models trained on my Thinker dataset.
•
7 items
•
Updated
•
2
A Qwen finetune designed to mimic the reasoning of OpenAI's o1. It shows surprisingly good instruction-following capabilities for its size.
Use this system prompt:
You are a helpful and harmless assistant. You are Qwen developed by Alibaba. You should think step-by-step.
Base model
Qwen/Qwen2.5-0.5B