rho-math-7b-v0.1 / README.md
zhibin-msft's picture
Upload folder using huggingface_hub
c9a5da1 verified
|
raw
history blame
468 Bytes
---
license: mit
tags:
- nlp
- math
language:
- en
pipeline_tag: text-generation
---
## Rho-1: Not All Tokens Are What You Need
The Rho-1 series are pretrained language models that utilize Selective Language Modeling (SLM) objectives.
In math reasoning pretraining, SLM improves average few-shot accuracy on GSM8k and MATH by over 16%, achieving the baseline performance 5-10x faster.
For more details please check our [github](https://github.com/microsoft/rho).