Update README.md
Browse files
README.md
CHANGED
@@ -10,13 +10,13 @@ license: mit
|
|
10 |
|
11 |
**Chuxin-1.6B-1M**是基于Chuxin-1.6B模型在1M窗口下训练后的结果,大海捞针实验显示其具有非常强的上下文检索能力。
|
12 |
|
13 |
-
如果您想了解更多关于Chuxin-1.6B开源模型的细节,我们建议您参阅我们的[技术报告](https://
|
14 |
|
15 |
**Chuxin-1.6B-Base** is a model with 1.6 billion parameters. Chuxin-1.6B is built entirely on open-source data. After being trained with large-scale data, Chuxin has very competitive capabilities in various downstream tasks.
|
16 |
|
17 |
**Chuxin-1.6B-1M** is the result of training the Chuxin-1.6B model with a 1M windows. Experiments such as searching for a needle in a haystack demonstrate its strong contextual retrieval abilities.
|
18 |
|
19 |
-
If you would like to learn more about the Chuxin-1.6B open-source model, we suggest you refer to our [technical report](https://
|
20 |
<br>
|
21 |
|
22 |
## 快速使用(Quickstart)
|
@@ -81,7 +81,7 @@ If you find our work helpful, feel free to give us a cite.
|
|
81 |
@article{chuxin,
|
82 |
title={CHUXIN: 1.6B TECHNICAL REPORT},
|
83 |
author={Xiaomin Zhuang, Yufan Jiang, Qiaozhi He, Zhihua Wu},
|
84 |
-
journal={arXiv preprint arXiv:
|
85 |
year={2024}
|
86 |
}
|
87 |
```
|
|
|
10 |
|
11 |
**Chuxin-1.6B-1M**是基于Chuxin-1.6B模型在1M窗口下训练后的结果,大海捞针实验显示其具有非常强的上下文检索能力。
|
12 |
|
13 |
+
如果您想了解更多关于Chuxin-1.6B开源模型的细节,我们建议您参阅我们的[技术报告](https://arxiv.org/pdf/2405.04828)
|
14 |
|
15 |
**Chuxin-1.6B-Base** is a model with 1.6 billion parameters. Chuxin-1.6B is built entirely on open-source data. After being trained with large-scale data, Chuxin has very competitive capabilities in various downstream tasks.
|
16 |
|
17 |
**Chuxin-1.6B-1M** is the result of training the Chuxin-1.6B model with a 1M windows. Experiments such as searching for a needle in a haystack demonstrate its strong contextual retrieval abilities.
|
18 |
|
19 |
+
If you would like to learn more about the Chuxin-1.6B open-source model, we suggest you refer to our [technical report](https://arxiv.org/pdf/2405.04828).
|
20 |
<br>
|
21 |
|
22 |
## 快速使用(Quickstart)
|
|
|
81 |
@article{chuxin,
|
82 |
title={CHUXIN: 1.6B TECHNICAL REPORT},
|
83 |
author={Xiaomin Zhuang, Yufan Jiang, Qiaozhi He, Zhihua Wu},
|
84 |
+
journal={arXiv preprint arXiv:2405.04828},
|
85 |
year={2024}
|
86 |
}
|
87 |
```
|