MohamedAAK commited on
Commit
88a4a55
·
verified ·
1 Parent(s): 9c14104

Upload model

Browse files
Files changed (3) hide show
  1. README.md +4 -807
  2. config.json +1 -1
  3. tf_model.h5 +2 -2
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  license: apache-2.0
3
- base_model: distilgpt2
4
  tags:
5
  - generated_from_keras_callback
6
  model-index:
@@ -13,10 +13,9 @@ probably proofread and complete it, then remove this comment. -->
13
 
14
  # my_awesome_power_model_llm
15
 
16
- This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 0.0462
19
- - Epoch: 799
20
 
21
  ## Model description
22
 
@@ -35,813 +34,11 @@ More information needed
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
- - optimizer: {'name': 'AdamWeightDecay', 'learning_rate': 4e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.01}
39
  - training_precision: float32
40
 
41
  ### Training results
42
 
43
- | Train Loss | Epoch |
44
- |:----------:|:-----:|
45
- | 9.5717 | 0 |
46
- | 3.5656 | 1 |
47
- | 3.2429 | 2 |
48
- | 3.0974 | 3 |
49
- | 3.0134 | 4 |
50
- | 2.9362 | 5 |
51
- | 2.8612 | 6 |
52
- | 2.7933 | 7 |
53
- | 2.7282 | 8 |
54
- | 2.6593 | 9 |
55
- | 2.5957 | 10 |
56
- | 2.5425 | 11 |
57
- | 2.4823 | 12 |
58
- | 2.4424 | 13 |
59
- | 2.3971 | 14 |
60
- | 2.3583 | 15 |
61
- | 2.3261 | 16 |
62
- | 2.2981 | 17 |
63
- | 2.2790 | 18 |
64
- | 2.2477 | 19 |
65
- | 2.2211 | 20 |
66
- | 2.2066 | 21 |
67
- | 2.1871 | 22 |
68
- | 2.1692 | 23 |
69
- | 2.1490 | 24 |
70
- | 2.1287 | 25 |
71
- | 2.1130 | 26 |
72
- | 2.0988 | 27 |
73
- | 2.0845 | 28 |
74
- | 2.0666 | 29 |
75
- | 2.0548 | 30 |
76
- | 2.0401 | 31 |
77
- | 2.0280 | 32 |
78
- | 2.0142 | 33 |
79
- | 1.9991 | 34 |
80
- | 1.9925 | 35 |
81
- | 1.9782 | 36 |
82
- | 1.9664 | 37 |
83
- | 1.9551 | 38 |
84
- | 1.9436 | 39 |
85
- | 1.9309 | 40 |
86
- | 1.9242 | 41 |
87
- | 1.9167 | 42 |
88
- | 1.9013 | 43 |
89
- | 1.8976 | 44 |
90
- | 1.8889 | 45 |
91
- | 1.8784 | 46 |
92
- | 1.8804 | 47 |
93
- | 1.8625 | 48 |
94
- | 1.8535 | 49 |
95
- | 1.8388 | 50 |
96
- | 1.8344 | 51 |
97
- | 1.8270 | 52 |
98
- | 1.8190 | 53 |
99
- | 1.8076 | 54 |
100
- | 1.8033 | 55 |
101
- | 1.7922 | 56 |
102
- | 1.7870 | 57 |
103
- | 1.7789 | 58 |
104
- | 1.7718 | 59 |
105
- | 1.7670 | 60 |
106
- | 1.7618 | 61 |
107
- | 1.7540 | 62 |
108
- | 1.7457 | 63 |
109
- | 1.7373 | 64 |
110
- | 1.7327 | 65 |
111
- | 1.7277 | 66 |
112
- | 1.7150 | 67 |
113
- | 1.7159 | 68 |
114
- | 1.7070 | 69 |
115
- | 1.6950 | 70 |
116
- | 1.6916 | 71 |
117
- | 1.6859 | 72 |
118
- | 1.6870 | 73 |
119
- | 1.6765 | 74 |
120
- | 1.6667 | 75 |
121
- | 1.6607 | 76 |
122
- | 1.6613 | 77 |
123
- | 1.6504 | 78 |
124
- | 1.6427 | 79 |
125
- | 1.6448 | 80 |
126
- | 1.6340 | 81 |
127
- | 1.6276 | 82 |
128
- | 1.6226 | 83 |
129
- | 1.6179 | 84 |
130
- | 1.6127 | 85 |
131
- | 1.6113 | 86 |
132
- | 1.5980 | 87 |
133
- | 1.5974 | 88 |
134
- | 1.5941 | 89 |
135
- | 1.5848 | 90 |
136
- | 1.5791 | 91 |
137
- | 1.5778 | 92 |
138
- | 1.5682 | 93 |
139
- | 1.5674 | 94 |
140
- | 1.5585 | 95 |
141
- | 1.5520 | 96 |
142
- | 1.5505 | 97 |
143
- | 1.5435 | 98 |
144
- | 1.5353 | 99 |
145
- | 1.5318 | 100 |
146
- | 1.5274 | 101 |
147
- | 1.5269 | 102 |
148
- | 1.5203 | 103 |
149
- | 1.5124 | 104 |
150
- | 1.5050 | 105 |
151
- | 1.5036 | 106 |
152
- | 1.4964 | 107 |
153
- | 1.4901 | 108 |
154
- | 1.4862 | 109 |
155
- | 1.4776 | 110 |
156
- | 1.4767 | 111 |
157
- | 1.4759 | 112 |
158
- | 1.4726 | 113 |
159
- | 1.4662 | 114 |
160
- | 1.4569 | 115 |
161
- | 1.4525 | 116 |
162
- | 1.4482 | 117 |
163
- | 1.4427 | 118 |
164
- | 1.4377 | 119 |
165
- | 1.4318 | 120 |
166
- | 1.4276 | 121 |
167
- | 1.4188 | 122 |
168
- | 1.4134 | 123 |
169
- | 1.4094 | 124 |
170
- | 1.4116 | 125 |
171
- | 1.3983 | 126 |
172
- | 1.3956 | 127 |
173
- | 1.3873 | 128 |
174
- | 1.3889 | 129 |
175
- | 1.3852 | 130 |
176
- | 1.3784 | 131 |
177
- | 1.3740 | 132 |
178
- | 1.3629 | 133 |
179
- | 1.3607 | 134 |
180
- | 1.3535 | 135 |
181
- | 1.3522 | 136 |
182
- | 1.3462 | 137 |
183
- | 1.3429 | 138 |
184
- | 1.3338 | 139 |
185
- | 1.3368 | 140 |
186
- | 1.3291 | 141 |
187
- | 1.3215 | 142 |
188
- | 1.3162 | 143 |
189
- | 1.3107 | 144 |
190
- | 1.3075 | 145 |
191
- | 1.2983 | 146 |
192
- | 1.2984 | 147 |
193
- | 1.2969 | 148 |
194
- | 1.2903 | 149 |
195
- | 1.2852 | 150 |
196
- | 1.2766 | 151 |
197
- | 1.2753 | 152 |
198
- | 1.2674 | 153 |
199
- | 1.2632 | 154 |
200
- | 1.2530 | 155 |
201
- | 1.2551 | 156 |
202
- | 1.2459 | 157 |
203
- | 1.2459 | 158 |
204
- | 1.2370 | 159 |
205
- | 1.2323 | 160 |
206
- | 1.2270 | 161 |
207
- | 1.2246 | 162 |
208
- | 1.2169 | 163 |
209
- | 1.2128 | 164 |
210
- | 1.2099 | 165 |
211
- | 1.2003 | 166 |
212
- | 1.2036 | 167 |
213
- | 1.1932 | 168 |
214
- | 1.1891 | 169 |
215
- | 1.1826 | 170 |
216
- | 1.1820 | 171 |
217
- | 1.1744 | 172 |
218
- | 1.1665 | 173 |
219
- | 1.1637 | 174 |
220
- | 1.1552 | 175 |
221
- | 1.1534 | 176 |
222
- | 1.1467 | 177 |
223
- | 1.1452 | 178 |
224
- | 1.1364 | 179 |
225
- | 1.1357 | 180 |
226
- | 1.1290 | 181 |
227
- | 1.1228 | 182 |
228
- | 1.1197 | 183 |
229
- | 1.1140 | 184 |
230
- | 1.1075 | 185 |
231
- | 1.1023 | 186 |
232
- | 1.0935 | 187 |
233
- | 1.0927 | 188 |
234
- | 1.0849 | 189 |
235
- | 1.0831 | 190 |
236
- | 1.0787 | 191 |
237
- | 1.0783 | 192 |
238
- | 1.0618 | 193 |
239
- | 1.0670 | 194 |
240
- | 1.0580 | 195 |
241
- | 1.0495 | 196 |
242
- | 1.0449 | 197 |
243
- | 1.0469 | 198 |
244
- | 1.0389 | 199 |
245
- | 1.0333 | 200 |
246
- | 1.0271 | 201 |
247
- | 1.0230 | 202 |
248
- | 1.0182 | 203 |
249
- | 1.0112 | 204 |
250
- | 1.0076 | 205 |
251
- | 1.0003 | 206 |
252
- | 1.0003 | 207 |
253
- | 0.9884 | 208 |
254
- | 0.9884 | 209 |
255
- | 0.9812 | 210 |
256
- | 0.9769 | 211 |
257
- | 0.9767 | 212 |
258
- | 0.9703 | 213 |
259
- | 0.9632 | 214 |
260
- | 0.9615 | 215 |
261
- | 0.9520 | 216 |
262
- | 0.9506 | 217 |
263
- | 0.9400 | 218 |
264
- | 0.9430 | 219 |
265
- | 0.9346 | 220 |
266
- | 0.9308 | 221 |
267
- | 0.9240 | 222 |
268
- | 0.9177 | 223 |
269
- | 0.9157 | 224 |
270
- | 0.9096 | 225 |
271
- | 0.9036 | 226 |
272
- | 0.8968 | 227 |
273
- | 0.8965 | 228 |
274
- | 0.8864 | 229 |
275
- | 0.8831 | 230 |
276
- | 0.8806 | 231 |
277
- | 0.8750 | 232 |
278
- | 0.8660 | 233 |
279
- | 0.8621 | 234 |
280
- | 0.8579 | 235 |
281
- | 0.8538 | 236 |
282
- | 0.8440 | 237 |
283
- | 0.8443 | 238 |
284
- | 0.8381 | 239 |
285
- | 0.8338 | 240 |
286
- | 0.8286 | 241 |
287
- | 0.8201 | 242 |
288
- | 0.8158 | 243 |
289
- | 0.8099 | 244 |
290
- | 0.8092 | 245 |
291
- | 0.7996 | 246 |
292
- | 0.8011 | 247 |
293
- | 0.7952 | 248 |
294
- | 0.7879 | 249 |
295
- | 0.7866 | 250 |
296
- | 0.7762 | 251 |
297
- | 0.7758 | 252 |
298
- | 0.7615 | 253 |
299
- | 0.7649 | 254 |
300
- | 0.7591 | 255 |
301
- | 0.7596 | 256 |
302
- | 0.7470 | 257 |
303
- | 0.7384 | 258 |
304
- | 0.7373 | 259 |
305
- | 0.7298 | 260 |
306
- | 0.7252 | 261 |
307
- | 0.7184 | 262 |
308
- | 0.7146 | 263 |
309
- | 0.7136 | 264 |
310
- | 0.7020 | 265 |
311
- | 0.7034 | 266 |
312
- | 0.6944 | 267 |
313
- | 0.6897 | 268 |
314
- | 0.6883 | 269 |
315
- | 0.6845 | 270 |
316
- | 0.6777 | 271 |
317
- | 0.6739 | 272 |
318
- | 0.6730 | 273 |
319
- | 0.6631 | 274 |
320
- | 0.6602 | 275 |
321
- | 0.6477 | 276 |
322
- | 0.6492 | 277 |
323
- | 0.6412 | 278 |
324
- | 0.6379 | 279 |
325
- | 0.6321 | 280 |
326
- | 0.6289 | 281 |
327
- | 0.6255 | 282 |
328
- | 0.6214 | 283 |
329
- | 0.6150 | 284 |
330
- | 0.6086 | 285 |
331
- | 0.6087 | 286 |
332
- | 0.6085 | 287 |
333
- | 0.5951 | 288 |
334
- | 0.5907 | 289 |
335
- | 0.5841 | 290 |
336
- | 0.5783 | 291 |
337
- | 0.5733 | 292 |
338
- | 0.5754 | 293 |
339
- | 0.5677 | 294 |
340
- | 0.5562 | 295 |
341
- | 0.5611 | 296 |
342
- | 0.5520 | 297 |
343
- | 0.5453 | 298 |
344
- | 0.5408 | 299 |
345
- | 0.5392 | 300 |
346
- | 0.5374 | 301 |
347
- | 0.5324 | 302 |
348
- | 0.5287 | 303 |
349
- | 0.5220 | 304 |
350
- | 0.5153 | 305 |
351
- | 0.5081 | 306 |
352
- | 0.5063 | 307 |
353
- | 0.5055 | 308 |
354
- | 0.4964 | 309 |
355
- | 0.4915 | 310 |
356
- | 0.4860 | 311 |
357
- | 0.4826 | 312 |
358
- | 0.4788 | 313 |
359
- | 0.4739 | 314 |
360
- | 0.4697 | 315 |
361
- | 0.4665 | 316 |
362
- | 0.4576 | 317 |
363
- | 0.4571 | 318 |
364
- | 0.4486 | 319 |
365
- | 0.4487 | 320 |
366
- | 0.4456 | 321 |
367
- | 0.4392 | 322 |
368
- | 0.4341 | 323 |
369
- | 0.4310 | 324 |
370
- | 0.4273 | 325 |
371
- | 0.4241 | 326 |
372
- | 0.4216 | 327 |
373
- | 0.4145 | 328 |
374
- | 0.4142 | 329 |
375
- | 0.4118 | 330 |
376
- | 0.4034 | 331 |
377
- | 0.3977 | 332 |
378
- | 0.3941 | 333 |
379
- | 0.3960 | 334 |
380
- | 0.3918 | 335 |
381
- | 0.3866 | 336 |
382
- | 0.3795 | 337 |
383
- | 0.3734 | 338 |
384
- | 0.3727 | 339 |
385
- | 0.3697 | 340 |
386
- | 0.3678 | 341 |
387
- | 0.3615 | 342 |
388
- | 0.3624 | 343 |
389
- | 0.3553 | 344 |
390
- | 0.3499 | 345 |
391
- | 0.3494 | 346 |
392
- | 0.3438 | 347 |
393
- | 0.3357 | 348 |
394
- | 0.3400 | 349 |
395
- | 0.3352 | 350 |
396
- | 0.3266 | 351 |
397
- | 0.3255 | 352 |
398
- | 0.3200 | 353 |
399
- | 0.3168 | 354 |
400
- | 0.3180 | 355 |
401
- | 0.3127 | 356 |
402
- | 0.3042 | 357 |
403
- | 0.3086 | 358 |
404
- | 0.3071 | 359 |
405
- | 0.3026 | 360 |
406
- | 0.2974 | 361 |
407
- | 0.2901 | 362 |
408
- | 0.2900 | 363 |
409
- | 0.2888 | 364 |
410
- | 0.2869 | 365 |
411
- | 0.2816 | 366 |
412
- | 0.2832 | 367 |
413
- | 0.2807 | 368 |
414
- | 0.2721 | 369 |
415
- | 0.2707 | 370 |
416
- | 0.2674 | 371 |
417
- | 0.2640 | 372 |
418
- | 0.2609 | 373 |
419
- | 0.2608 | 374 |
420
- | 0.2590 | 375 |
421
- | 0.2628 | 376 |
422
- | 0.2559 | 377 |
423
- | 0.2436 | 378 |
424
- | 0.2465 | 379 |
425
- | 0.2447 | 380 |
426
- | 0.2437 | 381 |
427
- | 0.2389 | 382 |
428
- | 0.2395 | 383 |
429
- | 0.2325 | 384 |
430
- | 0.2322 | 385 |
431
- | 0.2288 | 386 |
432
- | 0.2302 | 387 |
433
- | 0.2270 | 388 |
434
- | 0.2282 | 389 |
435
- | 0.2262 | 390 |
436
- | 0.2189 | 391 |
437
- | 0.2221 | 392 |
438
- | 0.2184 | 393 |
439
- | 0.2136 | 394 |
440
- | 0.2131 | 395 |
441
- | 0.2113 | 396 |
442
- | 0.2055 | 397 |
443
- | 0.2079 | 398 |
444
- | 0.2067 | 399 |
445
- | 0.2049 | 400 |
446
- | 0.1996 | 401 |
447
- | 0.2000 | 402 |
448
- | 0.2020 | 403 |
449
- | 0.1983 | 404 |
450
- | 0.1943 | 405 |
451
- | 0.1972 | 406 |
452
- | 0.1898 | 407 |
453
- | 0.1924 | 408 |
454
- | 0.1846 | 409 |
455
- | 0.1905 | 410 |
456
- | 0.1851 | 411 |
457
- | 0.1823 | 412 |
458
- | 0.1839 | 413 |
459
- | 0.1801 | 414 |
460
- | 0.1793 | 415 |
461
- | 0.1803 | 416 |
462
- | 0.1759 | 417 |
463
- | 0.1811 | 418 |
464
- | 0.1733 | 419 |
465
- | 0.1741 | 420 |
466
- | 0.1692 | 421 |
467
- | 0.1675 | 422 |
468
- | 0.1712 | 423 |
469
- | 0.1678 | 424 |
470
- | 0.1634 | 425 |
471
- | 0.1639 | 426 |
472
- | 0.1646 | 427 |
473
- | 0.1634 | 428 |
474
- | 0.1608 | 429 |
475
- | 0.1625 | 430 |
476
- | 0.1610 | 431 |
477
- | 0.1607 | 432 |
478
- | 0.1594 | 433 |
479
- | 0.1537 | 434 |
480
- | 0.1559 | 435 |
481
- | 0.1528 | 436 |
482
- | 0.1526 | 437 |
483
- | 0.1514 | 438 |
484
- | 0.1500 | 439 |
485
- | 0.1493 | 440 |
486
- | 0.1465 | 441 |
487
- | 0.1485 | 442 |
488
- | 0.1455 | 443 |
489
- | 0.1475 | 444 |
490
- | 0.1458 | 445 |
491
- | 0.1372 | 446 |
492
- | 0.1422 | 447 |
493
- | 0.1394 | 448 |
494
- | 0.1403 | 449 |
495
- | 0.1397 | 450 |
496
- | 0.1378 | 451 |
497
- | 0.1417 | 452 |
498
- | 0.1369 | 453 |
499
- | 0.1322 | 454 |
500
- | 0.1324 | 455 |
501
- | 0.1343 | 456 |
502
- | 0.1336 | 457 |
503
- | 0.1305 | 458 |
504
- | 0.1308 | 459 |
505
- | 0.1296 | 460 |
506
- | 0.1308 | 461 |
507
- | 0.1292 | 462 |
508
- | 0.1258 | 463 |
509
- | 0.1232 | 464 |
510
- | 0.1262 | 465 |
511
- | 0.1259 | 466 |
512
- | 0.1257 | 467 |
513
- | 0.1253 | 468 |
514
- | 0.1235 | 469 |
515
- | 0.1227 | 470 |
516
- | 0.1227 | 471 |
517
- | 0.1204 | 472 |
518
- | 0.1208 | 473 |
519
- | 0.1215 | 474 |
520
- | 0.1161 | 475 |
521
- | 0.1230 | 476 |
522
- | 0.1162 | 477 |
523
- | 0.1165 | 478 |
524
- | 0.1157 | 479 |
525
- | 0.1146 | 480 |
526
- | 0.1112 | 481 |
527
- | 0.1147 | 482 |
528
- | 0.1124 | 483 |
529
- | 0.1131 | 484 |
530
- | 0.1132 | 485 |
531
- | 0.1128 | 486 |
532
- | 0.1130 | 487 |
533
- | 0.1122 | 488 |
534
- | 0.1132 | 489 |
535
- | 0.1102 | 490 |
536
- | 0.1088 | 491 |
537
- | 0.1096 | 492 |
538
- | 0.1082 | 493 |
539
- | 0.1097 | 494 |
540
- | 0.1066 | 495 |
541
- | 0.1070 | 496 |
542
- | 0.1061 | 497 |
543
- | 0.1021 | 498 |
544
- | 0.1067 | 499 |
545
- | 0.1071 | 500 |
546
- | 0.1043 | 501 |
547
- | 0.1028 | 502 |
548
- | 0.1007 | 503 |
549
- | 0.1013 | 504 |
550
- | 0.1000 | 505 |
551
- | 0.1038 | 506 |
552
- | 0.1040 | 507 |
553
- | 0.1028 | 508 |
554
- | 0.1025 | 509 |
555
- | 0.0981 | 510 |
556
- | 0.1018 | 511 |
557
- | 0.1011 | 512 |
558
- | 0.0957 | 513 |
559
- | 0.0968 | 514 |
560
- | 0.0980 | 515 |
561
- | 0.0987 | 516 |
562
- | 0.0970 | 517 |
563
- | 0.0953 | 518 |
564
- | 0.0946 | 519 |
565
- | 0.0962 | 520 |
566
- | 0.0922 | 521 |
567
- | 0.0951 | 522 |
568
- | 0.0935 | 523 |
569
- | 0.0941 | 524 |
570
- | 0.0927 | 525 |
571
- | 0.0930 | 526 |
572
- | 0.0946 | 527 |
573
- | 0.0942 | 528 |
574
- | 0.0918 | 529 |
575
- | 0.0939 | 530 |
576
- | 0.0896 | 531 |
577
- | 0.0934 | 532 |
578
- | 0.0883 | 533 |
579
- | 0.0892 | 534 |
580
- | 0.0893 | 535 |
581
- | 0.0895 | 536 |
582
- | 0.0903 | 537 |
583
- | 0.0880 | 538 |
584
- | 0.0851 | 539 |
585
- | 0.0888 | 540 |
586
- | 0.0906 | 541 |
587
- | 0.0891 | 542 |
588
- | 0.0868 | 543 |
589
- | 0.0878 | 544 |
590
- | 0.0865 | 545 |
591
- | 0.0871 | 546 |
592
- | 0.0853 | 547 |
593
- | 0.0847 | 548 |
594
- | 0.0819 | 549 |
595
- | 0.0853 | 550 |
596
- | 0.0820 | 551 |
597
- | 0.0822 | 552 |
598
- | 0.0807 | 553 |
599
- | 0.0812 | 554 |
600
- | 0.0811 | 555 |
601
- | 0.0815 | 556 |
602
- | 0.0795 | 557 |
603
- | 0.0804 | 558 |
604
- | 0.0792 | 559 |
605
- | 0.0841 | 560 |
606
- | 0.0802 | 561 |
607
- | 0.0814 | 562 |
608
- | 0.0812 | 563 |
609
- | 0.0818 | 564 |
610
- | 0.0788 | 565 |
611
- | 0.0800 | 566 |
612
- | 0.0790 | 567 |
613
- | 0.0784 | 568 |
614
- | 0.0774 | 569 |
615
- | 0.0769 | 570 |
616
- | 0.0794 | 571 |
617
- | 0.0769 | 572 |
618
- | 0.0784 | 573 |
619
- | 0.0782 | 574 |
620
- | 0.0783 | 575 |
621
- | 0.0769 | 576 |
622
- | 0.0750 | 577 |
623
- | 0.0754 | 578 |
624
- | 0.0746 | 579 |
625
- | 0.0750 | 580 |
626
- | 0.0757 | 581 |
627
- | 0.0757 | 582 |
628
- | 0.0741 | 583 |
629
- | 0.0756 | 584 |
630
- | 0.0754 | 585 |
631
- | 0.0742 | 586 |
632
- | 0.0723 | 587 |
633
- | 0.0729 | 588 |
634
- | 0.0742 | 589 |
635
- | 0.0714 | 590 |
636
- | 0.0728 | 591 |
637
- | 0.0764 | 592 |
638
- | 0.0738 | 593 |
639
- | 0.0733 | 594 |
640
- | 0.0708 | 595 |
641
- | 0.0764 | 596 |
642
- | 0.0721 | 597 |
643
- | 0.0724 | 598 |
644
- | 0.0709 | 599 |
645
- | 0.0705 | 600 |
646
- | 0.0722 | 601 |
647
- | 0.0738 | 602 |
648
- | 0.0707 | 603 |
649
- | 0.0694 | 604 |
650
- | 0.0708 | 605 |
651
- | 0.0691 | 606 |
652
- | 0.0692 | 607 |
653
- | 0.0792 | 608 |
654
- | 0.0701 | 609 |
655
- | 0.0707 | 610 |
656
- | 0.0688 | 611 |
657
- | 0.0679 | 612 |
658
- | 0.0688 | 613 |
659
- | 0.0677 | 614 |
660
- | 0.0689 | 615 |
661
- | 0.0674 | 616 |
662
- | 0.0684 | 617 |
663
- | 0.0684 | 618 |
664
- | 0.0698 | 619 |
665
- | 0.0674 | 620 |
666
- | 0.0690 | 621 |
667
- | 0.0667 | 622 |
668
- | 0.0688 | 623 |
669
- | 0.0667 | 624 |
670
- | 0.0648 | 625 |
671
- | 0.0666 | 626 |
672
- | 0.0645 | 627 |
673
- | 0.0659 | 628 |
674
- | 0.0662 | 629 |
675
- | 0.0658 | 630 |
676
- | 0.0656 | 631 |
677
- | 0.0662 | 632 |
678
- | 0.0637 | 633 |
679
- | 0.0649 | 634 |
680
- | 0.0656 | 635 |
681
- | 0.0663 | 636 |
682
- | 0.0649 | 637 |
683
- | 0.0650 | 638 |
684
- | 0.0686 | 639 |
685
- | 0.0616 | 640 |
686
- | 0.0646 | 641 |
687
- | 0.0643 | 642 |
688
- | 0.0624 | 643 |
689
- | 0.0619 | 644 |
690
- | 0.0635 | 645 |
691
- | 0.0630 | 646 |
692
- | 0.0643 | 647 |
693
- | 0.0630 | 648 |
694
- | 0.0659 | 649 |
695
- | 0.0624 | 650 |
696
- | 0.0642 | 651 |
697
- | 0.0620 | 652 |
698
- | 0.0625 | 653 |
699
- | 0.0620 | 654 |
700
- | 0.0633 | 655 |
701
- | 0.0615 | 656 |
702
- | 0.0613 | 657 |
703
- | 0.0606 | 658 |
704
- | 0.0594 | 659 |
705
- | 0.0623 | 660 |
706
- | 0.0606 | 661 |
707
- | 0.0605 | 662 |
708
- | 0.0614 | 663 |
709
- | 0.0590 | 664 |
710
- | 0.0582 | 665 |
711
- | 0.0597 | 666 |
712
- | 0.0586 | 667 |
713
- | 0.0598 | 668 |
714
- | 0.0595 | 669 |
715
- | 0.0582 | 670 |
716
- | 0.0599 | 671 |
717
- | 0.0617 | 672 |
718
- | 0.0583 | 673 |
719
- | 0.0570 | 674 |
720
- | 0.0588 | 675 |
721
- | 0.0583 | 676 |
722
- | 0.0571 | 677 |
723
- | 0.0580 | 678 |
724
- | 0.0570 | 679 |
725
- | 0.0578 | 680 |
726
- | 0.0585 | 681 |
727
- | 0.0582 | 682 |
728
- | 0.0561 | 683 |
729
- | 0.0575 | 684 |
730
- | 0.0562 | 685 |
731
- | 0.0555 | 686 |
732
- | 0.0574 | 687 |
733
- | 0.0564 | 688 |
734
- | 0.0570 | 689 |
735
- | 0.0558 | 690 |
736
- | 0.0588 | 691 |
737
- | 0.0551 | 692 |
738
- | 0.0559 | 693 |
739
- | 0.0563 | 694 |
740
- | 0.0550 | 695 |
741
- | 0.0569 | 696 |
742
- | 0.0558 | 697 |
743
- | 0.0572 | 698 |
744
- | 0.0572 | 699 |
745
- | 0.0570 | 700 |
746
- | 0.0567 | 701 |
747
- | 0.0583 | 702 |
748
- | 0.0561 | 703 |
749
- | 0.0546 | 704 |
750
- | 0.0550 | 705 |
751
- | 0.0543 | 706 |
752
- | 0.0556 | 707 |
753
- | 0.0540 | 708 |
754
- | 0.0536 | 709 |
755
- | 0.0533 | 710 |
756
- | 0.0543 | 711 |
757
- | 0.0564 | 712 |
758
- | 0.0543 | 713 |
759
- | 0.0555 | 714 |
760
- | 0.0534 | 715 |
761
- | 0.0552 | 716 |
762
- | 0.0536 | 717 |
763
- | 0.0535 | 718 |
764
- | 0.0530 | 719 |
765
- | 0.0541 | 720 |
766
- | 0.0535 | 721 |
767
- | 0.0535 | 722 |
768
- | 0.0526 | 723 |
769
- | 0.0533 | 724 |
770
- | 0.0539 | 725 |
771
- | 0.0512 | 726 |
772
- | 0.0532 | 727 |
773
- | 0.0536 | 728 |
774
- | 0.0533 | 729 |
775
- | 0.0534 | 730 |
776
- | 0.0536 | 731 |
777
- | 0.0524 | 732 |
778
- | 0.0540 | 733 |
779
- | 0.0519 | 734 |
780
- | 0.0515 | 735 |
781
- | 0.0541 | 736 |
782
- | 0.0541 | 737 |
783
- | 0.0514 | 738 |
784
- | 0.0515 | 739 |
785
- | 0.0531 | 740 |
786
- | 0.0509 | 741 |
787
- | 0.0526 | 742 |
788
- | 0.0528 | 743 |
789
- | 0.0534 | 744 |
790
- | 0.0536 | 745 |
791
- | 0.0533 | 746 |
792
- | 0.0520 | 747 |
793
- | 0.0510 | 748 |
794
- | 0.0530 | 749 |
795
- | 0.0503 | 750 |
796
- | 0.0509 | 751 |
797
- | 0.0509 | 752 |
798
- | 0.0492 | 753 |
799
- | 0.0509 | 754 |
800
- | 0.0520 | 755 |
801
- | 0.0500 | 756 |
802
- | 0.0503 | 757 |
803
- | 0.0512 | 758 |
804
- | 0.0504 | 759 |
805
- | 0.0516 | 760 |
806
- | 0.0527 | 761 |
807
- | 0.0511 | 762 |
808
- | 0.0494 | 763 |
809
- | 0.0493 | 764 |
810
- | 0.0513 | 765 |
811
- | 0.0491 | 766 |
812
- | 0.0485 | 767 |
813
- | 0.0488 | 768 |
814
- | 0.0507 | 769 |
815
- | 0.0480 | 770 |
816
- | 0.0488 | 771 |
817
- | 0.0492 | 772 |
818
- | 0.0489 | 773 |
819
- | 0.0480 | 774 |
820
- | 0.0477 | 775 |
821
- | 0.0459 | 776 |
822
- | 0.0481 | 777 |
823
- | 0.0477 | 778 |
824
- | 0.0478 | 779 |
825
- | 0.0500 | 780 |
826
- | 0.0478 | 781 |
827
- | 0.0485 | 782 |
828
- | 0.0472 | 783 |
829
- | 0.0466 | 784 |
830
- | 0.0480 | 785 |
831
- | 0.0492 | 786 |
832
- | 0.0472 | 787 |
833
- | 0.0492 | 788 |
834
- | 0.0478 | 789 |
835
- | 0.0485 | 790 |
836
- | 0.0470 | 791 |
837
- | 0.0472 | 792 |
838
- | 0.0478 | 793 |
839
- | 0.0490 | 794 |
840
- | 0.0481 | 795 |
841
- | 0.0489 | 796 |
842
- | 0.0480 | 797 |
843
- | 0.0468 | 798 |
844
- | 0.0462 | 799 |
845
 
846
 
847
  ### Framework versions
 
1
  ---
2
  license: apache-2.0
3
+ base_model: MohamedAAK/my_awesome_power_model_llm
4
  tags:
5
  - generated_from_keras_callback
6
  model-index:
 
13
 
14
  # my_awesome_power_model_llm
15
 
16
+ This model is a fine-tuned version of [MohamedAAK/my_awesome_power_model_llm](https://huggingface.co/MohamedAAK/my_awesome_power_model_llm) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+
 
19
 
20
  ## Model description
21
 
 
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
+ - optimizer: None
38
  - training_precision: float32
39
 
40
  ### Training results
41
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
42
 
43
 
44
  ### Framework versions
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "distilgpt2",
3
  "_num_labels": 1,
4
  "activation_function": "gelu_new",
5
  "architectures": [
 
1
  {
2
+ "_name_or_path": "MohamedAAK/my_awesome_power_model_llm",
3
  "_num_labels": 1,
4
  "activation_function": "gelu_new",
5
  "architectures": [
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9df305bcaaeb8a5b305f0ead6ade14cae89d8ce10acade6c3400362232eb30c9
3
- size 327748552
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:edb7c8d916a62d35d4da1361f3404f16998e89a4298158a7621fa9ca768cbd10
3
+ size 327748544