Update README.md
Browse files
README.md
CHANGED
@@ -18,7 +18,7 @@ model-index:
|
|
18 |
metrics:
|
19 |
- name: bleu
|
20 |
type: bleu-4
|
21 |
-
value:
|
22 |
- name: rouge
|
23 |
type: rouge-l
|
24 |
value: 21.84
|
@@ -26,7 +26,7 @@ model-index:
|
|
26 |
|
27 |
# Llama-3.1-8B-Instruct-Galician
|
28 |
|
29 |
-
This model is a
|
30 |
|
31 |
## Model Details
|
32 |
|
@@ -36,21 +36,16 @@ This model is a fine-tuned version of [models/Meta-Llama-3.1-8B-Instruct](https:
|
|
36 |
|
37 |
|
38 |
|
39 |
-
- **Developed by:** [
|
40 |
-
- **Funded by [optional]:** [More Information Needed]
|
41 |
-
- **Shared by [optional]:** [More Information Needed]
|
42 |
- **Model type:** [More Information Needed]
|
43 |
-
- **Language(s) (NLP):**
|
44 |
-
- **License:**
|
45 |
-
- **Finetuned from model
|
46 |
|
47 |
-
### Model Sources
|
48 |
|
49 |
-
|
50 |
-
|
51 |
-
- **Repository:** [More Information Needed]
|
52 |
-
- **Paper [optional]:** [More Information Needed]
|
53 |
-
- **Demo [optional]:** [More Information Needed]
|
54 |
|
55 |
## Uses
|
56 |
|
@@ -173,24 +168,16 @@ The following hyperparameters were used during training:
|
|
173 |
#### Summary
|
174 |
|
175 |
|
176 |
-
|
177 |
-
## Model Examination [optional]
|
178 |
-
|
179 |
-
<!-- Relevant interpretability work for the model goes here -->
|
180 |
-
|
181 |
-
[More Information Needed]
|
182 |
-
|
183 |
## Environmental Impact
|
184 |
|
185 |
<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
|
186 |
|
187 |
Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
|
188 |
|
189 |
-
- **Hardware Type:**
|
190 |
-
- **Hours used:**
|
191 |
-
- **Cloud Provider:**
|
192 |
-
- **
|
193 |
-
- **Carbon Emitted:** [More Information Needed]
|
194 |
|
195 |
## Technical Specifications [optional]
|
196 |
|
|
|
18 |
metrics:
|
19 |
- name: bleu
|
20 |
type: bleu-4
|
21 |
+
value: 23.13
|
22 |
- name: rouge
|
23 |
type: rouge-l
|
24 |
value: 21.84
|
|
|
26 |
|
27 |
# Llama-3.1-8B-Instruct-Galician
|
28 |
|
29 |
+
This model is a continued pretraining version of [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) on the [CorpusNós](https://zenodo.org/records/11655219) dataset.
|
30 |
|
31 |
## Model Details
|
32 |
|
|
|
36 |
|
37 |
|
38 |
|
39 |
+
- **Developed by:** [UDC Information Retrieval Lab (IRLab)](https://huggingface.co/irlab-udc)
|
|
|
|
|
40 |
- **Model type:** [More Information Needed]
|
41 |
+
- **Language(s) (NLP):** Multilingual, adapted to Galician
|
42 |
+
- **License:** llama3.1
|
43 |
+
- **Finetuned from model:** [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct)
|
44 |
|
45 |
+
### Model Sources
|
46 |
|
47 |
+
- **Repository:** [Adapting Large Language Models for Underrepresented Languages](https://gitlab.irlab.org/eliseo.bao/xovetic-llms-underrepresented-languages)
|
48 |
+
- **Paper:** _Coming soon_
|
|
|
|
|
|
|
49 |
|
50 |
## Uses
|
51 |
|
|
|
168 |
#### Summary
|
169 |
|
170 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
171 |
## Environmental Impact
|
172 |
|
173 |
<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
|
174 |
|
175 |
Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
|
176 |
|
177 |
+
- **Hardware Type:** 4x NVIDIA A100 SXM4 80 GB (TDP of 400W)
|
178 |
+
- **Hours used:** 60
|
179 |
+
- **Cloud Provider:** Private infrastructure
|
180 |
+
- **Carbon Emitted:** 10.37 kgCO$_2$eq
|
|
|
181 |
|
182 |
## Technical Specifications [optional]
|
183 |
|