add model
Browse files- README.md +15 -6
- pytorch_model.bin +1 -1
- runs/Dec04_08-10-17_0555d0489403/1638605423.6123514/events.out.tfevents.1638605423.0555d0489403.772.1 +3 -0
- runs/Dec04_08-10-17_0555d0489403/events.out.tfevents.1638605423.0555d0489403.772.0 +3 -0
- runs/Dec04_08-15-21_0555d0489403/1638605727.360147/events.out.tfevents.1638605727.0555d0489403.941.1 +3 -0
- runs/Dec04_08-15-21_0555d0489403/events.out.tfevents.1638605727.0555d0489403.941.0 +3 -0
- training_args.bin +1 -1
README.md
CHANGED
@@ -13,7 +13,7 @@ model_index:
|
|
13 |
metric:
|
14 |
name: Bleu
|
15 |
type: bleu
|
16 |
-
value:
|
17 |
---
|
18 |
|
19 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
@@ -23,9 +23,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
23 |
|
24 |
This model is a fine-tuned version of [Helsinki-NLP/opus-mt-ja-en](https://huggingface.co/Helsinki-NLP/opus-mt-ja-en) on an unkown dataset.
|
25 |
It achieves the following results on the evaluation set:
|
26 |
-
- Loss: 0.
|
27 |
-
- Bleu:
|
28 |
-
- Gen Len: 27.
|
29 |
|
30 |
## Model description
|
31 |
|
@@ -50,14 +50,23 @@ The following hyperparameters were used during training:
|
|
50 |
- seed: 42
|
51 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
52 |
- lr_scheduler_type: linear
|
53 |
-
- num_epochs:
|
54 |
- mixed_precision_training: Native AMP
|
55 |
|
56 |
### Training results
|
57 |
|
58 |
| Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
|
59 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
|
60 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
61 |
|
62 |
|
63 |
### Framework versions
|
|
|
13 |
metric:
|
14 |
name: Bleu
|
15 |
type: bleu
|
16 |
+
value: 73.8646
|
17 |
---
|
18 |
|
19 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
23 |
|
24 |
This model is a fine-tuned version of [Helsinki-NLP/opus-mt-ja-en](https://huggingface.co/Helsinki-NLP/opus-mt-ja-en) on an unkown dataset.
|
25 |
It achieves the following results on the evaluation set:
|
26 |
+
- Loss: 0.7520
|
27 |
+
- Bleu: 73.8646
|
28 |
+
- Gen Len: 27.0884
|
29 |
|
30 |
## Model description
|
31 |
|
|
|
50 |
- seed: 42
|
51 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
52 |
- lr_scheduler_type: linear
|
53 |
+
- num_epochs: 10
|
54 |
- mixed_precision_training: Native AMP
|
55 |
|
56 |
### Training results
|
57 |
|
58 |
| Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
|
59 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
|
60 |
+
| 1.0512 | 1.0 | 748 | 0.8333 | 59.8234 | 27.905 |
|
61 |
+
| 0.6076 | 2.0 | 1496 | 0.7817 | 62.5606 | 26.1834 |
|
62 |
+
| 0.4174 | 3.0 | 2244 | 0.7817 | 64.8346 | 28.2918 |
|
63 |
+
| 0.2971 | 4.0 | 2992 | 0.7653 | 67.6013 | 27.2222 |
|
64 |
+
| 0.2172 | 5.0 | 3740 | 0.7295 | 69.4017 | 27.0174 |
|
65 |
+
| 0.1447 | 6.0 | 4488 | 0.7522 | 68.8355 | 28.2865 |
|
66 |
+
| 0.0953 | 7.0 | 5236 | 0.7596 | 71.4743 | 27.1861 |
|
67 |
+
| 0.0577 | 8.0 | 5984 | 0.7469 | 72.0684 | 26.921 |
|
68 |
+
| 0.04 | 9.0 | 6732 | 0.7526 | 73.2821 | 27.1365 |
|
69 |
+
| 0.0213 | 10.0 | 7480 | 0.7520 | 73.8646 | 27.0884 |
|
70 |
|
71 |
|
72 |
### Framework versions
|
pytorch_model.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 301227653
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:847eb272efc818497ba602ecdb24358b9ad99ebb3c6240c6ce5c99b21448d9a5
|
3 |
size 301227653
|
runs/Dec04_08-10-17_0555d0489403/1638605423.6123514/events.out.tfevents.1638605423.0555d0489403.772.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:fcd16674a405c3b433297eb8802a02ee3cd0de0c32b2ac029dc884de9db6ded7
|
3 |
+
size 4335
|
runs/Dec04_08-10-17_0555d0489403/events.out.tfevents.1638605423.0555d0489403.772.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:27a78cefdde17ea8d908d2671d0270a4f4a5614a0dc556805019c3587e4a1084
|
3 |
+
size 3731
|
runs/Dec04_08-15-21_0555d0489403/1638605727.360147/events.out.tfevents.1638605727.0555d0489403.941.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:36949264afaec059f6c852a4065e6b4d8e6aa5e2799867afb028cb4e2651f0a2
|
3 |
+
size 4335
|
runs/Dec04_08-15-21_0555d0489403/events.out.tfevents.1638605727.0555d0489403.941.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:cb5fc0a869b4a2f680b1b5db5c549d8b123fcfbf8e390f8c4a9ca44ef62fac3d
|
3 |
+
size 13593
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 2735
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f461c7340b1f840e9bd44be195bb48df204ab9b762a6f81e9d5011a5b7007180
|
3 |
size 2735
|