YuLan-Mini - a yulan-team Collection

yulan-team 's Collections

YuLan-Mini

updated 6 days ago

A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.

yulan-team/YuLan-Mini

Text Generation • Updated about 15 hours ago • 566 • 26

Note A highly capable 2.4B lightweight LLM using only 1T pre-training data.
yulan-team/YuLan-Mini-Datasets

Updated 6 days ago • 261 • 8
YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published 11 days ago • 59
yulan-team/YuLan-Mini-Before-Annealing

Updated 5 days ago • 34 • 5

Note The model & optimizer states of the last curriculum phase before learning rate annealing.
yulan-team/YuLan-Mini-Phase20

Updated 6 days ago • 8 • 2

Note The model & optimizer states of the 20th curriculum phase.