metadata
language: en
tags:
- fill-mask
Environmental Impact (CODE CARBON DEFAULT)
Metric | Value |
---|---|
Duration (in seconds) | [More Information Needed] |
Emissions (Co2eq in kg) | [More Information Needed] |
CPU power (W) | [NO CPU] |
GPU power (W) | [No GPU] |
RAM power (W) | [More Information Needed] |
CPU energy (kWh) | [No CPU] |
GPU energy (kWh) | [No GPU] |
RAM energy (kWh) | [More Information Needed] |
Consumed energy (kWh) | [More Information Needed] |
Country name | [More Information Needed] |
Cloud provider | [No Cloud] |
Cloud region | [No Cloud] |
CPU count | [No CPU] |
CPU model | [No CPU] |
GPU count | [No GPU] |
GPU model | [No GPU] |
Environmental Impact (for one core)
Metric | Value |
---|---|
CPU energy (kWh) | [No CPU] |
Emissions (Co2eq in kg) | [More Information Needed] |
Note
30 April 2024
My Config
Config | Value |
---|---|
checkpoint | albert-base-v2 |
model_name | BERTrand_bs64_lr6 |
sequence_length | 400 |
num_epoch | 12 |
learning_rate | 5e-06 |
batch_size | 64 |
weight_decay | 0.0 |
warm_up_prop | 0 |
drop_out_prob | 0.1 |
packing_length | 100 |
train_test_split | 0.2 |
num_steps | 3147 |
Training and Testing steps
Epoch | Train Loss | Test Loss |
---|---|---|
0.0 | 15.574399 | 15.096123 |
0.5 | 9.594637 | 8.148669 |
1.0 | 7.853338 | 8.074202 |
1.5 | 7.905947 | 7.939530 |
2.0 | 7.834033 | 7.833388 |
2.5 | 7.720610 | 7.871610 |
3.0 | 7.495963 | 7.976839 |
3.5 | 7.330389 | 7.752517 |
4.0 | 7.214343 | 7.848690 |
4.5 | 7.346055 | 7.724831 |
5.0 | 7.110836 | 7.715771 |
5.5 | 7.125741 | 7.595748 |
6.0 | 7.127250 | 7.659738 |
6.5 | 7.239036 | 7.671448 |
7.0 | 7.073343 | 7.705375 |
7.5 | 7.070813 | 7.589307 |
8.0 | 7.124647 | 7.582091 |
8.5 | 7.166616 | 7.539913 |
9.0 | 7.092505 | 7.611073 |
9.5 | 7.048057 | 7.625665 |
10.0 | 7.101367 | 7.564788 |
10.5 | 7.108332 | 7.602001 |
11.0 | 7.179604 | 7.554187 |
11.5 | 7.028062 | 7.575663 |