Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,39 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# ERNIE-2.0-large
|
2 |
+
|
3 |
+
## Introduction
|
4 |
+
ERNIE-health is a Chinese biomedical language model pre-trained from in-domain text of de-identified online doctor-patient dialogues, electronic medical records, and textbooks.
|
5 |
+
|
6 |
+
More detail:
|
7 |
+
https://github.com/PaddlePaddle/Research/tree/master/KG/eHealth
|
8 |
+
https://github.com/PaddlePaddle/PaddleNLP/tree/develop/model_zoo/ernie-health
|
9 |
+
https://arxiv.org/pdf/2110.07244.pdf
|
10 |
+
|
11 |
+
## Released Model Info
|
12 |
+
|
13 |
+
|Model Name|Language|Model Structure|
|
14 |
+
|:---:|:---:|:---:|
|
15 |
+
|ernie-health-zh| Chinese |Layer:12, Hidden:768, Heads:12|
|
16 |
+
|
17 |
+
This released pytorch model is converted from the officially released PaddlePaddle ERNIE model and
|
18 |
+
a series of experiments have been conducted to check the accuracy of the conversion.
|
19 |
+
|
20 |
+
- Official PaddlePaddle ERNIE repo:https://github.com/PaddlePaddle/Research/tree/master/KG/eHealth
|
21 |
+
- Pytorch Conversion repo: https://github.com/nghuyong/ERNIE-Pytorch
|
22 |
+
|
23 |
+
## How to use
|
24 |
+
```Python
|
25 |
+
from transformers import AutoTokenizer, AutoModel
|
26 |
+
tokenizer = AutoTokenizer.from_pretrained("nghuyong/ernie-health-zh")
|
27 |
+
model = AutoModel.from_pretrained("nghuyong/ernie-health-zh")
|
28 |
+
```
|
29 |
+
|
30 |
+
## Citation
|
31 |
+
|
32 |
+
```bibtex
|
33 |
+
@article{wang2021building,
|
34 |
+
title={Building Chinese Biomedical Language Models via Multi-Level Text Discrimination},
|
35 |
+
author={Wang, Quan and Dai, Songtai and Xu, Benfeng and Lyu, Yajuan and Zhu, Yong and Wu, Hua and Wang, Haifeng},
|
36 |
+
journal={arXiv preprint arXiv:2110.07244},
|
37 |
+
year={2021}
|
38 |
+
}
|
39 |
+
```
|