sometimesanotion PRO
sometimesanotion
AI & ML interests
Agentic LLM services, model merging, finetunes, distillation
Recent Activity
reacted
to
prithivMLmods's
post
with 🚀
about 17 hours ago
Reasoning SmolLM2 🚀
🎯Fine-tuning SmolLM2 on a lightweight synthetic reasoning dataset for reasoning-specific tasks. Future updates will focus on lightweight, blazing-fast reasoning models. Until then, check out the blog for fine-tuning details.
🔥Blog : https://huggingface.co/blog/prithivMLmods/smollm2-ft
🔼 Models :
+ SmolLM2-CoT-360M : https://huggingface.co/prithivMLmods/SmolLM2-CoT-360M
+ Reasoning-SmolLM2-135M : https://huggingface.co/prithivMLmods/Reasoning-SmolLM2-135M
+ SmolLM2-CoT-360M-GGUF : https://huggingface.co/prithivMLmods/SmolLM2-CoT-360M-GGUF
🤠 Other Details :
+ Demo : https://huggingface.co/prithivMLmods/SmolLM2-CoT-360M/blob/main/Demo/SmolLM2%20Demo.ipynb
+ Fine-tune nB : https://huggingface.co/prithivMLmods/SmolLM2-CoT-360M/blob/main/finetune/SmolLM-FT.ipynb
reacted
to
prithivMLmods's
post
with 🔥
about 17 hours ago
Reasoning SmolLM2 🚀
🎯Fine-tuning SmolLM2 on a lightweight synthetic reasoning dataset for reasoning-specific tasks. Future updates will focus on lightweight, blazing-fast reasoning models. Until then, check out the blog for fine-tuning details.
🔥Blog : https://huggingface.co/blog/prithivMLmods/smollm2-ft
🔼 Models :
+ SmolLM2-CoT-360M : https://huggingface.co/prithivMLmods/SmolLM2-CoT-360M
+ Reasoning-SmolLM2-135M : https://huggingface.co/prithivMLmods/Reasoning-SmolLM2-135M
+ SmolLM2-CoT-360M-GGUF : https://huggingface.co/prithivMLmods/SmolLM2-CoT-360M-GGUF
🤠 Other Details :
+ Demo : https://huggingface.co/prithivMLmods/SmolLM2-CoT-360M/blob/main/Demo/SmolLM2%20Demo.ipynb
+ Fine-tune nB : https://huggingface.co/prithivMLmods/SmolLM2-CoT-360M/blob/main/finetune/SmolLM-FT.ipynb
liked
a model
about 18 hours ago
qingy2024/UwU-7B-Instruct
Organizations
sometimesanotion's activity
Thank you for this!
1
#1 opened about 20 hours ago
by
sometimesanotion
Extra SLERP parameters
#1 opened 1 day ago
by
sometimesanotion
This should be an interesting merge
#1 opened 2 days ago
by
sometimesanotion
Thanks for mentioning the individual input models
6
#1 opened 9 days ago
by
CultriX
14B model detected as 7B
7
#1049 opened 15 days ago
by
djuna
What is the prompt template format?
1
#2 opened 8 days ago
by
nyonna
Result PRs not appearing
4
#1054 opened 11 days ago
by
sometimesanotion
Congratulations!
#2 opened 12 days ago
by
sometimesanotion
Interesting methods and results
20
#2 opened about 1 month ago
by
sometimesanotion
Look forward to evaluation
#1 opened 21 days ago
by
sometimesanotion
Based on Qwen 14B, and it LIES constantly about everything China related.
4
#7 opened 23 days ago
by
Maani
Adding Evaluation Results
#1 opened 29 days ago
by
leaderboard-pr-bot
Question about model's origin
2
#2 opened about 1 month ago
by
sometimesanotion
Next version
#1 opened about 1 month ago
by
sometimesanotion
Much appreciated
1
#1 opened about 1 month ago
by
sometimesanotion
Fascinating model, a question
#1 opened about 2 months ago
by
sometimesanotion
Update SuperNova-Medius with a merge with Qwen/Qwen2.5-Coder-14B-Instruct + Further Training 😋
11
#12 opened about 2 months ago
by
Joseph717171
Upload ONNX weights
1
#1 opened 2 months ago
by
Xenova
Multilingual, Uncensored and extensive vocabulary.
5
#4 opened 3 months ago
by
Kukedlc