File size: 2,267 Bytes
41891e6
af6cf4e
 
 
41891e6
 
af6cf4e
41891e6
af6cf4e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
41891e6
af6cf4e
41891e6
af6cf4e
 
 
 
41891e6
af6cf4e
41891e6
af6cf4e
41891e6
af6cf4e
41891e6
af6cf4e
 
 
 
 
 
 
 
 
 
 
 
 
 
41891e6
af6cf4e
41891e6
 
 
 
 
 
af6cf4e
 
 
 
ce66606
68f3682
4c1471f
d759656
97c481f
2373049
81eb803
04831d7
5c2418e
64d664f
b2da19e
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
---
language: en
tags:
- fill-mask
---

## Environmental Impact (CODE CARBON DEFAULT)

| Metric                   | Value                           |
|--------------------------|---------------------------------|
| Duration (in seconds)    | [More Information Needed]  |
| Emissions (Co2eq in kg)  | [More Information Needed] |
| CPU power (W)            | [NO CPU]  |
| GPU power (W)            | [No GPU]  |
| RAM power (W)            | [More Information Needed]  |
| CPU energy (kWh)         | [No CPU]  |
| GPU energy (kWh)         | [No GPU]  |
| RAM energy (kWh)         | [More Information Needed]  |
| Consumed energy (kWh)    | [More Information Needed]  |
| Country name             | [More Information Needed]  |
| Cloud provider           | [No Cloud]  |
| Cloud region             | [No Cloud]  |
| CPU count                | [No CPU]  |
| CPU model                | [No CPU]  |
| GPU count                | [No GPU]  |
| GPU model                | [No GPU]  |

## Environmental Impact (for one core)

| Metric                   | Value                           |
|--------------------------|---------------------------------|
| CPU energy (kWh)         | [No CPU]  |
| Emissions (Co2eq in kg)  | [More Information Needed] |

## Note

30 April 2024

## My Config

| Config                   | Value           |
|--------------------------|-----------------|
| checkpoint               | albert-base-v2  |
| model_name               | BERTrand_bs32_lr6 |
| sequence_length          | 400  |
| num_epoch                | 12  |
| learning_rate            | 5e-06  |
| batch_size               | 32  |
| weight_decay             | 0.0  |
| warm_up_prop             | 0  |
| drop_out_prob            | 0.1 |
| packing_length           | 100 |
| train_test_split         | 0.2 |
| num_steps                | 6318 |

## Training and Testing steps






 
Epoch | Train Loss | Test Loss
---|---|---
| 0.0 | 15.603048 | 15.109937 |
| 0.5 | 8.715844 | 8.071290 |
| 1.0 | 7.608879 | 8.114126 |
| 1.5 | 7.407612 | 7.914163 |
| 2.0 | 7.323461 | 7.774658 |
| 2.5 | 7.248362 | 7.696718 |
| 3.0 | 7.101276 | 7.856242 |
| 3.5 | 7.134161 | 7.617901 |
| 4.0 | 7.105548 | 7.837306 |
| 4.5 | 7.221799 | 7.653854 |
| 5.0 | 7.047156 | 7.659136 |
| 5.5 | 7.080983 | 7.554190 |