BERTrand_bs64_lr6 / README.md
damgomz's picture
Upload README.md with huggingface_hub
063a0fc verified
|
raw
history blame
2.63 kB
metadata
language: en
tags:
  - fill-mask

Environmental Impact (CODE CARBON DEFAULT)

Metric Value
Duration (in seconds) [More Information Needed]
Emissions (Co2eq in kg) [More Information Needed]
CPU power (W) [NO CPU]
GPU power (W) [No GPU]
RAM power (W) [More Information Needed]
CPU energy (kWh) [No CPU]
GPU energy (kWh) [No GPU]
RAM energy (kWh) [More Information Needed]
Consumed energy (kWh) [More Information Needed]
Country name [More Information Needed]
Cloud provider [No Cloud]
Cloud region [No Cloud]
CPU count [No CPU]
CPU model [No CPU]
GPU count [No GPU]
GPU model [No GPU]

Environmental Impact (for one core)

Metric Value
CPU energy (kWh) [No CPU]
Emissions (Co2eq in kg) [More Information Needed]

Note

30 April 2024

My Config

Config Value
checkpoint albert-base-v2
model_name BERTrand_bs64_lr6
sequence_length 400
num_epoch 12
learning_rate 5e-06
batch_size 64
weight_decay 0.0
warm_up_prop 0
drop_out_prob 0.1
packing_length 100
train_test_split 0.2
num_steps 3147

Training and Testing steps

Epoch Train Loss Test Loss
0.0 15.574399 15.096123
0.5 9.594637 8.148669
1.0 7.853338 8.074202
1.5 7.905947 7.939530
2.0 7.834033 7.833388
2.5 7.720610 7.871610
3.0 7.495963 7.976839
3.5 7.330389 7.752517
4.0 7.214343 7.848690
4.5 7.346055 7.724831
5.0 7.110836 7.715771
5.5 7.125741 7.595748
6.0 7.127250 7.659738
6.5 7.239036 7.671448
7.0 7.073343 7.705375
7.5 7.070813 7.589307
8.0 7.124647 7.582091
8.5 7.166616 7.539913
9.0 7.092505 7.611073
9.5 7.048057 7.625665
10.0 7.101367 7.564788
10.5 7.108332 7.602001
11.0 7.179604 7.554187
11.5 7.028062 7.575663