fahadqazi
/

testts1234

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [fahadqazi/testts1234](https://huggingface.co/fahadqazi/testts1234) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3086
 ## Model description
@@ -34,39 +34,34 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1e-05
 - train_batch_size: 64
 - eval_batch_size: 64
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
-- lr_scheduler_type: constant
-- training_steps: 1000
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss |
-|:-------------:|:------:|:----:|:---------------:|
-| 0.3542        | 0.3401 | 50   | 0.3087          |
-| 0.3558        | 0.6803 | 100  | 0.3088          |
-| 0.3571        | 1.0204 | 150  | 0.3085          |
-| 0.3578        | 1.3605 | 200  | 0.3090          |
-| 0.3608        | 1.7007 | 250  | 0.3091          |
-| 0.3508        | 2.0408 | 300  | 0.3090          |
-| 0.3551        | 2.3810 | 350  | 0.3088          |
-| 0.3553        | 2.7211 | 400  | 0.3096          |
-| 0.3572        | 3.0612 | 450  | 0.3090          |
-| 0.3517        | 3.4014 | 500  | 0.3096          |
-| 0.3633        | 3.7415 | 550  | 0.3094          |
-| 0.3612        | 4.0816 | 600  | 0.3093          |
-| 0.3655        | 4.4218 | 650  | 0.3091          |
-| 0.3619        | 4.7619 | 700  | 0.3090          |
-| 0.3601        | 5.1020 | 750  | 0.3090          |
-| 0.3557        | 5.4422 | 800  | 0.3092          |
-| 0.3533        | 5.7823 | 850  | 0.3094          |
-| 0.3531        | 6.1224 | 900  | 0.3091          |
-| 0.3597        | 6.4626 | 950  | 0.3100          |
-| 0.3559        | 6.8027 | 1000 | 0.3086          |
 ### Framework versions

 This model is a fine-tuned version of [fahadqazi/testts1234](https://huggingface.co/fahadqazi/testts1234) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3092
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-06
 - train_batch_size: 64
 - eval_batch_size: 64
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: cosine
+- training_steps: 1500
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch   | Step | Validation Loss |
+|:-------------:|:-------:|:----:|:---------------:|
+| 0.3565        | 0.6803  | 100  | 0.3094          |
+| 0.3532        | 1.3605  | 200  | 0.3091          |
+| 0.3528        | 2.0408  | 300  | 0.3084          |
+| 0.3619        | 2.7211  | 400  | 0.3092          |
+| 0.3574        | 3.4014  | 500  | 0.3088          |
+| 0.3584        | 4.0816  | 600  | 0.3090          |
+| 0.3526        | 4.7619  | 700  | 0.3090          |
+| 0.356         | 5.4422  | 800  | 0.3087          |
+| 0.3528        | 6.1224  | 900  | 0.3085          |
+| 0.353         | 6.8027  | 1000 | 0.3091          |
+| 0.3606        | 7.4830  | 1100 | 0.3087          |
+| 0.3531        | 8.1633  | 1200 | 0.3088          |
+| 0.3581        | 8.8435  | 1300 | 0.3088          |
+| 0.3503        | 9.5238  | 1400 | 0.3086          |
+| 0.3458        | 10.2041 | 1500 | 0.3092          |
 ### Framework versions