license: mit language: - en tags: - text generation datasets: - fhswf/TinyStoriesV2_cleaned
Based on get-neo BPE Tokenizer, but with a smaller vocabulary. Trained with TinyStoriesV2.