Narrativa commited on
Commit
9a51c41
·
1 Parent(s): 5c237a7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md CHANGED
@@ -9,6 +9,7 @@ tags:
9
  - longformer
10
  - robertalex
11
  - spanish
 
12
 
13
  ---
14
 
@@ -23,6 +24,16 @@ tags:
23
 
24
  This model was made following the research done by [Iz Beltagy and Matthew E. Peters and Arman Cohan](https://arxiv.org/abs/2004.05150).
25
 
 
 
 
 
 
 
 
 
 
 
26
  ## Citation
27
  If you want to cite this model you can use this:
28
 
 
9
  - longformer
10
  - robertalex
11
  - spanish
12
+ - legal
13
 
14
  ---
15
 
 
24
 
25
  This model was made following the research done by [Iz Beltagy and Matthew E. Peters and Arman Cohan](https://arxiv.org/abs/2004.05150).
26
 
27
+ ## Model (base checkpoint)
28
+ [RoBERTalex](https://huggingface.co/PlanTL-GOB-ES/RoBERTalex?)
29
+ There are few models trained for the Spanish language. Some of the models have been trained with a low resource, unclean corpora. The ones derived from the Spanish National Plan for Language Technologies are proficient in solving several tasks and have been trained using large-scale clean corpora. However, the Spanish Legal domain language could be thought of as an independent language on its own. We, therefore, created a Spanish Legal model from scratch trained exclusively on legal corpora.
30
+
31
+ ## Dataset
32
+ [Spanish Legal Domain Corpora](https://zenodo.org/record/5495529)
33
+ A collection of corpora of Spanish legal domain.
34
+
35
+ More legal domain resources: https://github.com/PlanTL-GOB-ES/lm-legal-es
36
+
37
  ## Citation
38
  If you want to cite this model you can use this:
39