Open LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 64 items • Updated 30 minutes ago • 495
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings Paper • 2501.01257 • Published 1 day ago • 30
YuLan-Mini Collection A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 5 items • Updated 6 days ago • 10
Tooka Collection This collection hosts the transformers and original repos of the Tooka releases. • 3 items • Updated 28 days ago • 1
CorDA: Context-Oriented Decomposition Adaptation of Large Language Models Paper • 2406.05223 • Published Jun 7, 2024 • 4
Mirror Collection Mirror: A Universal Framework for Various Information Extraction Tasks https://arxiv.org/abs/2311.05419 • 5 items • Updated Oct 11, 2024 • 1
Autoregressive Video Generation without Vector Quantization Paper • 2412.14169 • Published 16 days ago • 14
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation Paper • 2412.07589 • Published 24 days ago • 46
xFinder: Robust and Pinpoint Answer Extraction for Large Language Models Paper • 2405.11874 • Published May 20, 2024 • 7
Grimoire is All You Need for Enhancing Large Language Models Paper • 2401.03385 • Published Jan 7, 2024 • 5
xFinder Collection The official collection for "xFinder: Robust and Pinpoint Answer Extraction for Large Language Models". • 4 items • Updated Nov 7, 2024 • 4
Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception Paper • 2410.12788 • Published Oct 16, 2024 • 24
BiMediX2 Collection BiMediX2 : Bio-Medical EXpert LMM for Diverse Medical Modalities • 5 items • Updated 17 days ago • 6
ArTST - Arabic Text Speech Transformer Collection Open source project for Arabic Speech Recognition and Generation • 6 items • Updated Nov 7, 2024 • 4
TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models Paper • 2310.06762 • Published Oct 10, 2023 • 2
Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling Paper • 2411.00750 • Published Nov 1, 2024 • 1
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published 17 days ago • 41
Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models Paper • 2407.21077 • Published Jul 29, 2024 • 1