Update README.md
Browse files
README.md
CHANGED
@@ -9,6 +9,7 @@ tags:
|
|
9 |
- longformer
|
10 |
- robertalex
|
11 |
- spanish
|
|
|
12 |
|
13 |
---
|
14 |
|
@@ -23,6 +24,16 @@ tags:
|
|
23 |
|
24 |
This model was made following the research done by [Iz Beltagy and Matthew E. Peters and Arman Cohan](https://arxiv.org/abs/2004.05150).
|
25 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
26 |
## Citation
|
27 |
If you want to cite this model you can use this:
|
28 |
|
|
|
9 |
- longformer
|
10 |
- robertalex
|
11 |
- spanish
|
12 |
+
- legal
|
13 |
|
14 |
---
|
15 |
|
|
|
24 |
|
25 |
This model was made following the research done by [Iz Beltagy and Matthew E. Peters and Arman Cohan](https://arxiv.org/abs/2004.05150).
|
26 |
|
27 |
+
## Model (base checkpoint)
|
28 |
+
[RoBERTalex](https://huggingface.co/PlanTL-GOB-ES/RoBERTalex?)
|
29 |
+
There are few models trained for the Spanish language. Some of the models have been trained with a low resource, unclean corpora. The ones derived from the Spanish National Plan for Language Technologies are proficient in solving several tasks and have been trained using large-scale clean corpora. However, the Spanish Legal domain language could be thought of as an independent language on its own. We, therefore, created a Spanish Legal model from scratch trained exclusively on legal corpora.
|
30 |
+
|
31 |
+
## Dataset
|
32 |
+
[Spanish Legal Domain Corpora](https://zenodo.org/record/5495529)
|
33 |
+
A collection of corpora of Spanish legal domain.
|
34 |
+
|
35 |
+
More legal domain resources: https://github.com/PlanTL-GOB-ES/lm-legal-es
|
36 |
+
|
37 |
## Citation
|
38 |
If you want to cite this model you can use this:
|
39 |
|