20 2 66

sometimesanotion PRO

sometimesanotion

AI & ML interests

Agentic LLM services, model merging, finetunes, distillation

Recent Activity

reacted to prithivMLmods's post with 🚀 about 17 hours ago

Reasoning SmolLM2 🚀 🎯Fine-tuning SmolLM2 on a lightweight synthetic reasoning dataset for reasoning-specific tasks. Future updates will focus on lightweight, blazing-fast reasoning models. Until then, check out the blog for fine-tuning details. 🔥Blog : https://huggingface.co/blog/prithivMLmods/smollm2-ft 🔼 Models : + SmolLM2-CoT-360M : https://huggingface.co/prithivMLmods/SmolLM2-CoT-360M + Reasoning-SmolLM2-135M : https://huggingface.co/prithivMLmods/Reasoning-SmolLM2-135M + SmolLM2-CoT-360M-GGUF : https://huggingface.co/prithivMLmods/SmolLM2-CoT-360M-GGUF 🤠 Other Details : + Demo : https://huggingface.co/prithivMLmods/SmolLM2-CoT-360M/blob/main/Demo/SmolLM2%20Demo.ipynb + Fine-tune nB : https://huggingface.co/prithivMLmods/SmolLM2-CoT-360M/blob/main/finetune/SmolLM-FT.ipynb

reacted to prithivMLmods's post with 🔥 about 17 hours ago

liked a model about 18 hours ago

qingy2024/UwU-7B-Instruct

View all activity

Organizations

sometimesanotion's activity

New activity in mradermacher/Lamarck-14B-v0.6-GGUF about 20 hours ago

Thank you for this!

#1 opened about 20 hours ago by

sometimesanotion

New activity in bamec66557/Qwen-2.5-14B-MINUS 1 day ago

Extra SLERP parameters

#1 opened 1 day ago by

sometimesanotion

New activity in hotmailuser/QwenSlerp2-14B 1 day ago

This should be an interesting merge

#1 opened 2 days ago by

sometimesanotion

New activity in sometimesanotion/Qwen2.5-14B-Vimarckoso-v3 7 days ago

Thanks for mentioning the individual input models

#1 opened 9 days ago by

CultriX

New activity in open-llm-leaderboard/open_llm_leaderboard 8 days ago

14B model detected as 7B

#1049 opened 15 days ago by

djuna

New activity in sometimesanotion/Qwen2.5-14B-Vimarckoso-v3 8 days ago

What is the prompt template format?

#2 opened 8 days ago by

nyonna

New activity in open-llm-leaderboard/open_llm_leaderboard 11 days ago

Result PRs not appearing

#1054 opened 11 days ago by

sometimesanotion

New activity in sthenno-com/miscii-14b-1225 12 days ago

Congratulations!

#2 opened 12 days ago by

sometimesanotion

New activity in CultriX/SeQwence-14Bv3 19 days ago

Interesting methods and results

#2 opened about 1 month ago by

sometimesanotion

New activity in Aashraf995/QwenStock-14B 21 days ago

Look forward to evaluation

#1 opened 21 days ago by

sometimesanotion

New activity in arcee-ai/Virtuoso-Small 22 days ago

Based on Qwen 14B, and it LIES constantly about everything China related.

#7 opened 23 days ago by

Maani

New activity in sometimesanotion/Lamarck-14B-v0.3 29 days ago

Adding Evaluation Results

#1 opened 29 days ago by

leaderboard-pr-bot

New activity in arcee-ai/Virtuoso-Small about 1 month ago

Question about model's origin

#2 opened about 1 month ago by

sometimesanotion

New activity in sometimesanotion/KytheraMix-7B-v0.2 about 1 month ago

Next version

#1 opened about 1 month ago by

sometimesanotion

New activity in mradermacher/AgoraMix-14B-stock-v0.1-i1-GGUF about 1 month ago

Much appreciated

#1 opened about 1 month ago by

sometimesanotion

New activity in edgerunner-ai/EdgeRunner-Command-Nested about 2 months ago

Fascinating model, a question

#1 opened about 2 months ago by

sometimesanotion

New activity in anthracite-org/magnum-v4-12b about 2 months ago

Request

#5 opened about 2 months ago by

isr431

New activity in arcee-ai/SuperNova-Medius about 2 months ago

Update SuperNova-Medius with a merge with Qwen/Qwen2.5-Coder-14B-Instruct + Further Training 😋

#12 opened about 2 months ago by

Joseph717171

New activity in HuggingFaceTB/SmolLM2-1.7B-Instruct 2 months ago

Upload ONNX weights

#1 opened 2 months ago by

Xenova

New activity in arcee-ai/SuperNova-Medius 2 months ago

Multilingual, Uncensored and extensive vocabulary.

#4 opened 3 months ago by

Kukedlc