Update README.md
Browse files
README.md
CHANGED
@@ -11,8 +11,8 @@ datasets:
|
|
11 |
# Cedille AI
|
12 |
Cedille is a project to bring large language models to non-English languages.
|
13 |
|
14 |
-
##
|
15 |
-
Boris is a 6B parameter autoregressive language model based on the GPT-J architecture and trained using the
|
16 |
|
17 |
Boris was trained on around 78B tokens of French text from the [C4](https://huggingface.co/datasets/c4) dataset. We started training from GPT-J, which has been trained on [The Pile](https://pile.eleuther.ai/). As a consequence the model still has good performance in English language. Boris makes use of the unmodified GPT-2 tokenizer.
|
18 |
|
@@ -21,7 +21,7 @@ Boris is named after the great French writer [Boris Vian](https://en.wikipedia.o
|
|
21 |
# How do I test Cedille?
|
22 |
For the time being, the easiest way to test the model is to use our [publicly accessible playground](https://en.cedille.ai/).
|
23 |
|
24 |
-
Cedille is a relatively large model and running it in production can get expensive. Consider contacting us for API access.
|
25 |
|
26 |
# How do I cite Cedille?
|
27 |
Thanks for citing our work in case you build on top of Cedille. For the time being, please reference our work like so:
|
|
|
11 |
# Cedille AI
|
12 |
Cedille is a project to bring large language models to non-English languages.
|
13 |
|
14 |
+
## fr-boris
|
15 |
+
Boris is a 6B parameter autoregressive language model based on the GPT-J architecture and trained using the [mesh-transformer-jax](https://github.com/kingoflolz/mesh-transformer-jax) codebase.
|
16 |
|
17 |
Boris was trained on around 78B tokens of French text from the [C4](https://huggingface.co/datasets/c4) dataset. We started training from GPT-J, which has been trained on [The Pile](https://pile.eleuther.ai/). As a consequence the model still has good performance in English language. Boris makes use of the unmodified GPT-2 tokenizer.
|
18 |
|
|
|
21 |
# How do I test Cedille?
|
22 |
For the time being, the easiest way to test the model is to use our [publicly accessible playground](https://en.cedille.ai/).
|
23 |
|
24 |
+
Cedille is a relatively large model and running it in production can get expensive. Consider contacting us for API access at info@coteries.com.
|
25 |
|
26 |
# How do I cite Cedille?
|
27 |
Thanks for citing our work in case you build on top of Cedille. For the time being, please reference our work like so:
|