๐ช SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos โข 12 items โข Updated Dec 22, 2024 โข 209
OpenCulture Collection A multilingual dataset of public domain books and newspapers. โข 27 items โข Updated Nov 6, 2024 โข 123