Cedille commited on
Commit
cdc598a
·
1 Parent(s): 78e44d1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -11,8 +11,8 @@ datasets:
11
  # Cedille AI
12
  Cedille is a project to bring large language models to non-English languages.
13
 
14
- ## Boris
15
- Boris is a 6B parameter autoregressive language model based on the GPT-J architecture and trained using the (mesh-transformer-jax)[https://github.com/kingoflolz/mesh-transformer-jax] codebase.
16
 
17
  Boris was trained on around 78B tokens of French text from the [C4](https://huggingface.co/datasets/c4) dataset. We started training from GPT-J, which has been trained on [The Pile](https://pile.eleuther.ai/). As a consequence the model still has good performance in English language. Boris makes use of the unmodified GPT-2 tokenizer.
18
 
@@ -21,7 +21,7 @@ Boris is named after the great French writer [Boris Vian](https://en.wikipedia.o
21
  # How do I test Cedille?
22
  For the time being, the easiest way to test the model is to use our [publicly accessible playground](https://en.cedille.ai/).
23
 
24
- Cedille is a relatively large model and running it in production can get expensive. Consider contacting us for API access.
25
 
26
  # How do I cite Cedille?
27
  Thanks for citing our work in case you build on top of Cedille. For the time being, please reference our work like so:
 
11
  # Cedille AI
12
  Cedille is a project to bring large language models to non-English languages.
13
 
14
+ ## fr-boris
15
+ Boris is a 6B parameter autoregressive language model based on the GPT-J architecture and trained using the [mesh-transformer-jax](https://github.com/kingoflolz/mesh-transformer-jax) codebase.
16
 
17
  Boris was trained on around 78B tokens of French text from the [C4](https://huggingface.co/datasets/c4) dataset. We started training from GPT-J, which has been trained on [The Pile](https://pile.eleuther.ai/). As a consequence the model still has good performance in English language. Boris makes use of the unmodified GPT-2 tokenizer.
18
 
 
21
  # How do I test Cedille?
22
  For the time being, the easiest way to test the model is to use our [publicly accessible playground](https://en.cedille.ai/).
23
 
24
+ Cedille is a relatively large model and running it in production can get expensive. Consider contacting us for API access at info@coteries.com.
25
 
26
  # How do I cite Cedille?
27
  Thanks for citing our work in case you build on top of Cedille. For the time being, please reference our work like so: