Narrativa
/

legal-longformer-base-4096-spanish

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Narrativa commited on Nov 10, 2022

Commit

9a51c41

·

1 Parent(s): 5c237a7

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -9,6 +9,7 @@ tags:
 - longformer
 - robertalex
 - spanish
 ---
@@ -23,6 +24,16 @@ tags:
 This model was made following the research done by [Iz Beltagy and Matthew E. Peters and Arman Cohan](https://arxiv.org/abs/2004.05150).
 ## Citation
 If you want to cite this model you can use this:

 - longformer
 - robertalex
 - spanish
+- legal
 ---
 This model was made following the research done by [Iz Beltagy and Matthew E. Peters and Arman Cohan](https://arxiv.org/abs/2004.05150).
+## Model (base checkpoint)
+[RoBERTalex](https://huggingface.co/PlanTL-GOB-ES/RoBERTalex?)
+There are few models trained for the Spanish language. Some of the models have been trained with a low resource, unclean corpora. The ones derived from the Spanish National Plan for Language Technologies are proficient in solving several tasks and have been trained using large-scale clean corpora. However, the Spanish Legal domain language could be thought of as an independent language on its own. We, therefore, created a Spanish Legal model from scratch trained exclusively on legal corpora.
+## Dataset
+[Spanish Legal Domain Corpora](https://zenodo.org/record/5495529)
+A collection of corpora of Spanish legal domain.
+More legal domain resources: https://github.com/PlanTL-GOB-ES/lm-legal-es
 ## Citation
 If you want to cite this model you can use this: