I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token Paper • 2412.06676 • Published 30 days ago • 9
konstantindobler/mistral7b-de-tokenizer-swap-pure-bf16-v2-anneal-ablation Text Generation • Updated Aug 23, 2024 • 19
konstantindobler/mistral7b-de-tokenizer-swap-pure-bf16-v2 Text Generation • Updated Aug 23, 2024 • 22
konstantindobler/mistral7b-ar-tokenizer-swap-pure-bf16-anneal-ablation Text Generation • Updated Aug 23, 2024 • 22
kd-shared/culturax-ar-spbpe32k-focus-embs-anneal-bf16-mixed-xassy-final Text Generation • Updated Jun 25, 2024 • 19
Running on CPU Upgrade 119 🏆 Open Arabic LLM Leaderboard Track, rank and evaluate open Arabic LLMs and chatbots