AI-Sweden-Models
/

gpt-sw3-6.7b-v2-translator

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

timpal0l commited on Apr 2, 2024

Commit

81fdab0

·

verified ·

1 Parent(s): acad2d2

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -72,4 +72,4 @@ print(response[0]["generated_text"].split("<s>Bot: ")[-1])
 ```
 ## Training & Data:
-The training was done on 1 NVIDIA DGX using DeepSpeed ZeRO 3 for three epochs on roughly 4GB of carefully selected translation data.

 ```
 ## Training & Data:
+The training was done on 1 NVIDIA DGX using DeepSpeed ZeRO 3 for three epochs on roughly 4GB of carefully selected translation data. It is a full finetune of all of the model parameters.