alvp commited on
Commit
fa738bf
·
1 Parent(s): c8f41ab

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +37 -0
README.md CHANGED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: es
3
+ license: CC-BY 4.0
4
+ tags:
5
+ - spanish
6
+ - bert
7
+ pipeline_tag: fill-mask
8
+ widget:
9
+ - text: "Fui a la librería a comprar un <mask>."
10
+ ---
11
+
12
+ # ALBERTI
13
+
14
+ ALBERTI is a BERT-based multilingual model for poetry. This model has been further trained with the PULPO corpus using [Flax](https://github.com/google/flax), including training scripts.
15
+
16
+ This is part of the
17
+ [Flax/Jax Community Week](https://discuss.huggingface.co/t/open-to-the-community-community-week-using-jax-flax-for-nlp-cv/7104), organised by [HuggingFace](https://huggingface.co/) and TPU usage sponsored by Google.
18
+
19
+ ## PULPO
20
+
21
+ PULPO, the Prodigious Unannounced Literary Poetry Corpus is a set of multilingual corpora of verses and stanzas with over 95M words.
22
+
23
+
24
+ ## Team members
25
+
26
+ - Álvaro Pérez ([alvp](https://huggingface.co/alvp))
27
+ - Javier de la Rosa ([versae](https://huggingface.co/versae))
28
+ - Aitor Díaz ([aitordiaz](https://huggingface.co/aitordiaz))
29
+
30
+ ## Useful links
31
+
32
+ - [Community Week timeline](https://discuss.huggingface.co/t/open-to-the-community-community-week-using-jax-flax-for-nlp-cv/7104#summary-timeline-calendar-6)
33
+ - [Community Week README](https://github.com/huggingface/transformers/blob/master/examples/research_projects/jax-projects/README.md)
34
+ - [Community Week thread](https://discuss.huggingface.co/t/bertin-pretrain-roberta-large-from-scratch-in-spanish/7125)
35
+ - [Community Week channel](https://discord.com/channels/858019234139602994/859113060068229190)
36
+ - [Masked Language Modelling example scripts](https://github.com/huggingface/transformers/tree/master/examples/flax/language-modeling)
37
+ - [Model Repository](https://huggingface.co/flax-community/alberti-bert-base-multilingual-cased/)