50 271 258

Yassine Ennaour

Lyte

AI & ML interests

None yet

Recent Activity

updated a model about 3 hours ago

Lyte/Titans-MAC-test-bad-run-with-bug

liked a model about 10 hours ago

deepseek-ai/DeepSeek-R1

published a model about 10 hours ago

Lyte/Titans-MAC-test-bad-run-with-bug

View all activity

Organizations

Lyte's activity

updated a model about 3 hours ago

Lyte/Titans-MAC-test-bad-run-with-bug

Text Generation • Updated about 3 hours ago

liked a model about 10 hours ago

deepseek-ai/DeepSeek-R1

Updated about 3 hours ago • 879

published a model about 10 hours ago

Lyte/Titans-MAC-test-bad-run-with-bug

Text Generation • Updated about 3 hours ago

liked a Space about 14 hours ago

Running

🏢

Pdfitdown

Convert (almost) everything to PDF!

liked a model about 15 hours ago

bartowski/DeepSeek-R1-Distill-Qwen-1.5B-GGUF

Text Generation • Updated about 15 hours ago • 12

liked 3 models about 17 hours ago

liked a Space 2 days ago

Running on Zero

1.27k

❤️

Kokoro TTS

Now in 5 languages!

liked a dataset 3 days ago

HuggingFaceFW/fineweb-2

Viewer • Updated 12 days ago • 12.5B • 56.9k • 393

liked 3 models 7 days ago

openbmb/MiniCPM-o-2_6

Any-to-Any • Updated about 1 hour ago • 18.3k • 695

Qwen/Qwen2.5-Math-7B-PRM800K

Text Classification • Updated 4 days ago • 161 • 9

Qwen/Qwen2.5-Math-PRM-7B

Text Classification • Updated 4 days ago • 2.85k • 44

upvoted an article 9 days ago

Article

TerjamaBench: A Cultural Benchmark for English-Darija Machine Translation

•

11 days ago

• 20

reacted to hexgrad's post with 🚀🔥 12 days ago

Post

16088

📣 Looking for labeled, high-quality synthetic audio/TTS data 📣 Have you been or are you currently calling API endpoints from OpenAI, ElevenLabs, etc? Do you have labeled audio data sitting around gathering dust? Let's talk! Join https://discord.gg/QuGxSWBfQy or comment down below.

If your data exceeds quantity & quality thresholds and is approved into the next hexgrad/Kokoro-82M training mix, and you permissively DM me the data under an effective Apache license, then I will DM back the corresponding voicepacks for YOUR data if/when the next Apache-licensed Kokoro base model drops.

What does this mean? If you've been calling closed-source TTS or audio API endpoints to:
- Build voice agents
- Make long-form audio, like audiobooks or podcasts
- Handle customer support, etc
Then YOU can contribute to the training mix and get useful artifacts in return. ❤️

More details at hexgrad/Kokoro-82M#21

20 replies

upvoted a paper 12 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 13 days ago • 237

liked a model 13 days ago

ICTNLP/llava-mini-llama-3.1-8b

Image-Text-to-Text • Updated 8 days ago • 4.53k • 36

reacted to alielfilali01's post with 👍 13 days ago

Post

1809

3C3H AraGen Leaderboard welcomes today deepseek-ai/DeepSeek-V3 and 12 other models (including the late gpt-3.5 💀) to the ranking of best LLMs in Arabic !

Observations:
- DeepSeek-v3 ranked 3rd and only Open model among the top 5 !

- A 14B open model ( Qwen/Qwen2.5-14B-Instruct) outperforms gpt-3.5-turbo-0125 (from last year). This shows how much we came in advancing and supporting Arabic presence within the LLM ecosystem !

- Contrary to what observed in likelihood-acc leaderboards (like OALL/Open-Arabic-LLM-Leaderboard) further finetuned models like maldv/Qwentile2.5-32B-Instruct actually decreased the performance compared to the original model Qwen/Qwen2.5-32B-Instruct.
It's worth to note that the decrease is statiscally insignificant which imply that at best, the out-domain finetuning do not really hurts the model original capabilities acquired during pretraining.
Previous work addressed this (finetuning VS pretraining) but more investigation in this regard is required (any PhDs here ? This could be your question ...)

Check out the latest rankings: inceptionai/AraGen-Leaderboard

liked a model 13 days ago

Lightricks/LTX-Video

Image-to-Video • Updated Dec 19, 2024 • 97.9k • 876