SzegedAI
/

hubertusz-medium-wiki

generated_from_keras_callback

Inference Endpoints

Model card Files Files and versions Community

ficsort commited on Oct 18, 2022

Commit

f09778d

·

1 Parent(s): 2519a5b

Update README.md

Files changed (1) hide show

README.md +18 -27

README.md CHANGED Viewed

@@ -1,43 +1,31 @@
 ---
 tags:
 - generated_from_keras_callback
 model-index:
 - name: hubert-medium-wiki
   results: []
 ---
-<!-- This model card has been generated automatically according to the information Keras had access to. You should
-probably proofread and complete it, then remove this comment. -->
 # hubert-medium-wiki
-This model was trained from scratch on an unknown dataset.
-It achieves the following results on the evaluation set:
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
-### Training hyperparameters
-The following hyperparameters were used during training:
-- optimizer: None
-- training_precision: float32
-### Training results
 ### Framework versions
@@ -45,3 +33,6 @@ The following hyperparameters were used during training:
 - TensorFlow 2.10.0
 - Datasets 2.4.0
 - Tokenizers 0.12.1

 ---
+language: hu
+license: apache-2.0
+datasets:
+- wikipedia
 tags:
 - generated_from_keras_callback
+- hubert
 model-index:
 - name: hubert-medium-wiki
   results: []
 ---
 # hubert-medium-wiki
+This model was trained from scratch on the Wikipedia subset of Hungarian Webcorpus 2.0 with MLM and SOP tasks.
+### Pre-Training Parameters:
+First phase:
+- Training steps: 500.000
+- Sequence length: 128
+- Batch size: 1024
+Second phase:
+- Training steps: 100.000
+- Sequence length: 512
+- Batch size: 384
 ### Framework versions
 - TensorFlow 2.10.0
 - Datasets 2.4.0
 - Tokenizers 0.12.1
+# Acknowledgement
+[![Artificial Intelligence - National Laboratory - Hungary](https://milab.tk.hu/uploads/images/milab_logo_en.png)](https://mi.nemzetilabor.hu/)