kz919
/

QwQ-0.5B-Distilled

Model card Files Files and versions Community

kz919 commited on 3 days ago

Commit

c02bf5d

•

1 Parent(s): 4dc4d9e

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -28,7 +28,7 @@ base_model:
 QwQ-0.5B-Distilled was trained using the **QwQ-LongCoT-130K dataset**, a carefully curated collection of long-context examples designed for reasoning and conversational AI tasks. The GKD framework ensures that the student model mimics the teacher model’s outputs, aligning its predictions with high-quality responses.
 ### Training Progress:
-[▓░░░░░░░░░░] 10%
 ### Training Script:

 QwQ-0.5B-Distilled was trained using the **QwQ-LongCoT-130K dataset**, a carefully curated collection of long-context examples designed for reasoning and conversational AI tasks. The GKD framework ensures that the student model mimics the teacher model’s outputs, aligning its predictions with high-quality responses.
 ### Training Progress:
+[▓▓▓▓░░░░░░] 23%
 ### Training Script: