Update README.md
Browse files
README.md
CHANGED
@@ -35,8 +35,9 @@ base_model:
|
|
35 |
# Model Card for Teuken-7B-instruct-research-v0.4
|
36 |
|
37 |
|
38 |
-
Teuken-7B-instruct-research-v0.4 is an instruction-tuned 7B parameter multilingual large language model (LLM) pre-trained with 4T tokens within the research project OpenGPT-X.
|
39 |
-
[Teuken-7B-base-v0.4](https://huggingface.co/openGPT-X/Teuken-7B-base-v0.4) is
|
|
|
40 |
|
41 |
|
42 |
### Model Description
|
@@ -52,12 +53,12 @@ Teuken-7B-instruct-research-v0.4 is an instruction-tuned 7B parameter multilingu
|
|
52 |
## Uses
|
53 |
|
54 |
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
55 |
-
[Teuken-7B-
|
56 |
-
Since the underlying base model is trained on all 24 EU languages, Teuken-7B-instruct-research-v0.4 is also intended for research use in these 24 languages. Teuken-7B-instruct-research-v0.4 is a fine-tuned variant with a special focus on German and English language.
|
57 |
|
58 |
## Disclaimer Toxic Content:
|
59 |
|
60 |
-
This Language Model (LLM) may generate content that is inappropriate, offensive, or harmful. While the dataset has been heavily filtered to minimize such outputs, the model may still produce text that is biased or toxic due to the large scale and diverse nature of the data.
|
61 |
|
62 |
|
63 |
### Out-of-Scope Use
|
@@ -70,7 +71,7 @@ The model is not intended for use in math and coding tasks.
|
|
70 |
|
71 |
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
72 |
|
73 |
-
Teuken-7B-instruct-research-v0.4 is an instruction-tuned version of [Teuken-7B-base-v0.4](https://huggingface.co/openGPT-X/Teuken-7B-base-v0.4) that is not completely free from biases and hallucinations.
|
74 |
|
75 |
## How to Get Started with the Model
|
76 |
|
|
|
35 |
# Model Card for Teuken-7B-instruct-research-v0.4
|
36 |
|
37 |
|
38 |
+
[Teuken-7B-instruct-research-v0.4](https://huggingface.co/openGPT-X/Teuken-7B-instruct-research-v0.4) is an instruction-tuned 7B parameter multilingual large language model (LLM) pre-trained with 4T tokens within the research project OpenGPT-X.
|
39 |
+
The base model [Teuken-7B-base-v0.4](https://huggingface.co/openGPT-X/Teuken-7B-base-v0.4) is available on request.
|
40 |
+
|
41 |
|
42 |
|
43 |
### Model Description
|
|
|
53 |
## Uses
|
54 |
|
55 |
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
56 |
+
[Teuken-7B-instruct-research-v0.4](https://huggingface.co/openGPT-X/Teuken-7B-instruct-research-v0.4) focuses on covering all 24 EU languages and therefore renders more stable results across these languages and better reflects European values in its answers than English-centric models. It is therefore specialized for use in multilingual tasks.
|
57 |
+
Since the underlying base model is trained on all 24 EU languages, Teuken-7B-instruct-research-v0.4 is also intended for research use in these 24 languages. [Teuken-7B-instruct-research-v0.4](https://huggingface.co/openGPT-X/Teuken-7B-instruct-research-v0.4) is a fine-tuned variant with a special focus on German and English language.
|
58 |
|
59 |
## Disclaimer Toxic Content:
|
60 |
|
61 |
+
This Large Language Model (LLM) may generate content that is inappropriate, offensive, or harmful. While the dataset has been heavily filtered to minimize such outputs, the model may still produce text that is biased or toxic due to the large scale and diverse nature of the data.
|
62 |
|
63 |
|
64 |
### Out-of-Scope Use
|
|
|
71 |
|
72 |
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
73 |
|
74 |
+
[Teuken-7B-instruct-research-v0.4](https://huggingface.co/openGPT-X/Teuken-7B-instruct-research-v0.4) is an instruction-tuned version of [Teuken-7B-base-v0.4](https://huggingface.co/openGPT-X/Teuken-7B-base-v0.4) (base model is available on request) that is not completely free from biases and hallucinations.
|
75 |
|
76 |
## How to Get Started with the Model
|
77 |
|