jialicheng commited on
Commit
df9641b
·
verified ·
1 Parent(s): 7409b00

Upload folder using huggingface_hub

Browse files
README.md ADDED
@@ -0,0 +1,161 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: google/vit-base-patch16-224-in21k
4
+ tags:
5
+ - image-classification
6
+ - vision
7
+ - generated_from_trainer
8
+ metrics:
9
+ - accuracy
10
+ model-index:
11
+ - name: vit-base
12
+ results: []
13
+ ---
14
+
15
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
16
+ should probably proofread and complete it, then remove this comment. -->
17
+
18
+ # vit-base
19
+
20
+ This model is a fine-tuned version of [google/vit-base-patch16-224-in21k](https://huggingface.co/google/vit-base-patch16-224-in21k) on the cifar100 dataset.
21
+ It achieves the following results on the evaluation set:
22
+ - Loss: 0.3142
23
+ - Accuracy: 0.9197
24
+
25
+ ## Model description
26
+
27
+ More information needed
28
+
29
+ ## Intended uses & limitations
30
+
31
+ More information needed
32
+
33
+ ## Training and evaluation data
34
+
35
+ More information needed
36
+
37
+ ## Training procedure
38
+
39
+ ### Training hyperparameters
40
+
41
+ The following hyperparameters were used during training:
42
+ - learning_rate: 1e-05
43
+ - train_batch_size: 128
44
+ - eval_batch_size: 256
45
+ - seed: 42
46
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
+ - lr_scheduler_type: linear
48
+ - num_epochs: 100
49
+
50
+ ### Training results
51
+
52
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
+ |:-------------:|:-----:|:-----:|:---------------:|:--------:|
54
+ | 4.0904 | 1.0 | 333 | 4.0142 | 0.6663 |
55
+ | 3.5092 | 2.0 | 666 | 3.3563 | 0.7659 |
56
+ | 3.0987 | 3.0 | 999 | 2.9060 | 0.8043 |
57
+ | 2.7858 | 4.0 | 1332 | 2.5428 | 0.827 |
58
+ | 2.4372 | 5.0 | 1665 | 2.2353 | 0.8428 |
59
+ | 2.2157 | 6.0 | 1998 | 1.9597 | 0.8568 |
60
+ | 1.9573 | 7.0 | 2331 | 1.7241 | 0.8685 |
61
+ | 1.841 | 8.0 | 2664 | 1.5210 | 0.8736 |
62
+ | 1.6085 | 9.0 | 2997 | 1.3363 | 0.8832 |
63
+ | 1.4188 | 10.0 | 3330 | 1.1857 | 0.8861 |
64
+ | 1.3424 | 11.0 | 3663 | 1.0522 | 0.8923 |
65
+ | 1.1924 | 12.0 | 3996 | 0.9380 | 0.8983 |
66
+ | 1.1764 | 13.0 | 4329 | 0.8405 | 0.8999 |
67
+ | 1.0548 | 14.0 | 4662 | 0.7641 | 0.9024 |
68
+ | 0.9714 | 15.0 | 4995 | 0.6897 | 0.9069 |
69
+ | 0.9141 | 16.0 | 5328 | 0.6327 | 0.9047 |
70
+ | 0.8937 | 17.0 | 5661 | 0.5862 | 0.9065 |
71
+ | 0.79 | 18.0 | 5994 | 0.5389 | 0.9104 |
72
+ | 0.6761 | 19.0 | 6327 | 0.5092 | 0.9075 |
73
+ | 0.7064 | 20.0 | 6660 | 0.4760 | 0.9162 |
74
+ | 0.7224 | 21.0 | 6993 | 0.4502 | 0.9127 |
75
+ | 0.712 | 22.0 | 7326 | 0.4289 | 0.913 |
76
+ | 0.6541 | 23.0 | 7659 | 0.4088 | 0.913 |
77
+ | 0.6338 | 24.0 | 7992 | 0.3914 | 0.9172 |
78
+ | 0.6097 | 25.0 | 8325 | 0.3776 | 0.9182 |
79
+ | 0.6369 | 26.0 | 8658 | 0.3676 | 0.9155 |
80
+ | 0.6007 | 27.0 | 8991 | 0.3632 | 0.9149 |
81
+ | 0.6059 | 28.0 | 9324 | 0.3552 | 0.9187 |
82
+ | 0.5227 | 29.0 | 9657 | 0.3454 | 0.9178 |
83
+ | 0.6712 | 30.0 | 9990 | 0.3375 | 0.9183 |
84
+ | 0.5053 | 31.0 | 10323 | 0.3355 | 0.9171 |
85
+ | 0.5432 | 32.0 | 10656 | 0.3328 | 0.917 |
86
+ | 0.4617 | 33.0 | 10989 | 0.3295 | 0.9191 |
87
+ | 0.4784 | 34.0 | 11322 | 0.3250 | 0.918 |
88
+ | 0.5088 | 35.0 | 11655 | 0.3188 | 0.9195 |
89
+ | 0.5121 | 36.0 | 11988 | 0.3188 | 0.9172 |
90
+ | 0.4734 | 37.0 | 12321 | 0.3174 | 0.9193 |
91
+ | 0.5554 | 38.0 | 12654 | 0.3108 | 0.9196 |
92
+ | 0.4573 | 39.0 | 12987 | 0.3111 | 0.9203 |
93
+ | 0.4692 | 40.0 | 13320 | 0.3074 | 0.9203 |
94
+ | 0.481 | 41.0 | 13653 | 0.3042 | 0.922 |
95
+ | 0.4888 | 42.0 | 13986 | 0.3058 | 0.921 |
96
+ | 0.4032 | 43.0 | 14319 | 0.3025 | 0.9211 |
97
+ | 0.4731 | 44.0 | 14652 | 0.3063 | 0.9202 |
98
+ | 0.4574 | 45.0 | 14985 | 0.3052 | 0.92 |
99
+ | 0.3993 | 46.0 | 15318 | 0.3098 | 0.9215 |
100
+ | 0.4631 | 47.0 | 15651 | 0.3078 | 0.9201 |
101
+ | 0.409 | 48.0 | 15984 | 0.3056 | 0.9197 |
102
+ | 0.4584 | 49.0 | 16317 | 0.3060 | 0.9208 |
103
+ | 0.3853 | 50.0 | 16650 | 0.3061 | 0.9208 |
104
+ | 0.3836 | 51.0 | 16983 | 0.3072 | 0.9216 |
105
+ | 0.3969 | 52.0 | 17316 | 0.3070 | 0.9197 |
106
+ | 0.453 | 53.0 | 17649 | 0.3060 | 0.9188 |
107
+ | 0.3802 | 54.0 | 17982 | 0.3046 | 0.9204 |
108
+ | 0.4191 | 55.0 | 18315 | 0.3075 | 0.9208 |
109
+ | 0.4245 | 56.0 | 18648 | 0.3018 | 0.9205 |
110
+ | 0.4356 | 57.0 | 18981 | 0.3033 | 0.9214 |
111
+ | 0.348 | 58.0 | 19314 | 0.3081 | 0.9208 |
112
+ | 0.4232 | 59.0 | 19647 | 0.3058 | 0.9198 |
113
+ | 0.3363 | 60.0 | 19980 | 0.3066 | 0.9195 |
114
+ | 0.3537 | 61.0 | 20313 | 0.3067 | 0.9197 |
115
+ | 0.3613 | 62.0 | 20646 | 0.3065 | 0.9192 |
116
+ | 0.4121 | 63.0 | 20979 | 0.3086 | 0.9211 |
117
+ | 0.3939 | 64.0 | 21312 | 0.3095 | 0.9207 |
118
+ | 0.3616 | 65.0 | 21645 | 0.3061 | 0.9215 |
119
+ | 0.3645 | 66.0 | 21978 | 0.3085 | 0.9197 |
120
+ | 0.42 | 67.0 | 22311 | 0.3088 | 0.9191 |
121
+ | 0.3862 | 68.0 | 22644 | 0.3083 | 0.9193 |
122
+ | 0.3519 | 69.0 | 22977 | 0.3103 | 0.9187 |
123
+ | 0.4464 | 70.0 | 23310 | 0.3111 | 0.9192 |
124
+ | 0.3852 | 71.0 | 23643 | 0.3116 | 0.919 |
125
+ | 0.3406 | 72.0 | 23976 | 0.3082 | 0.9194 |
126
+ | 0.3785 | 73.0 | 24309 | 0.3071 | 0.9191 |
127
+ | 0.3559 | 74.0 | 24642 | 0.3101 | 0.9194 |
128
+ | 0.3298 | 75.0 | 24975 | 0.3099 | 0.9187 |
129
+ | 0.3596 | 76.0 | 25308 | 0.3099 | 0.9208 |
130
+ | 0.3419 | 77.0 | 25641 | 0.3120 | 0.9201 |
131
+ | 0.3918 | 78.0 | 25974 | 0.3077 | 0.9201 |
132
+ | 0.3571 | 79.0 | 26307 | 0.3119 | 0.9195 |
133
+ | 0.3609 | 80.0 | 26640 | 0.3120 | 0.9195 |
134
+ | 0.3324 | 81.0 | 26973 | 0.3120 | 0.9194 |
135
+ | 0.3387 | 82.0 | 27306 | 0.3118 | 0.9199 |
136
+ | 0.441 | 83.0 | 27639 | 0.3117 | 0.92 |
137
+ | 0.359 | 84.0 | 27972 | 0.3132 | 0.9195 |
138
+ | 0.3106 | 85.0 | 28305 | 0.3131 | 0.9204 |
139
+ | 0.3191 | 86.0 | 28638 | 0.3130 | 0.9201 |
140
+ | 0.3987 | 87.0 | 28971 | 0.3141 | 0.9202 |
141
+ | 0.3327 | 88.0 | 29304 | 0.3138 | 0.9194 |
142
+ | 0.3464 | 89.0 | 29637 | 0.3142 | 0.9207 |
143
+ | 0.3634 | 90.0 | 29970 | 0.3145 | 0.9207 |
144
+ | 0.3123 | 91.0 | 30303 | 0.3133 | 0.9197 |
145
+ | 0.3029 | 92.0 | 30636 | 0.3138 | 0.92 |
146
+ | 0.3814 | 93.0 | 30969 | 0.3124 | 0.9192 |
147
+ | 0.2953 | 94.0 | 31302 | 0.3126 | 0.9203 |
148
+ | 0.3475 | 95.0 | 31635 | 0.3141 | 0.9206 |
149
+ | 0.3406 | 96.0 | 31968 | 0.3141 | 0.9197 |
150
+ | 0.3448 | 97.0 | 32301 | 0.3141 | 0.9198 |
151
+ | 0.3687 | 98.0 | 32634 | 0.3137 | 0.9205 |
152
+ | 0.345 | 99.0 | 32967 | 0.3144 | 0.92 |
153
+ | 0.3582 | 100.0 | 33300 | 0.3142 | 0.9197 |
154
+
155
+
156
+ ### Framework versions
157
+
158
+ - Transformers 4.39.3
159
+ - Pytorch 2.2.2+cu118
160
+ - Datasets 2.18.0
161
+ - Tokenizers 0.15.2
all_results.json ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "dr_accuracy": 0.9849176470588236,
3
+ "dr_loss": 0.08670716732740402,
4
+ "dr_runtime": 105.5564,
5
+ "dr_samples_per_second": 402.628,
6
+ "dr_steps_per_second": 1.582,
7
+ "epoch": 100.0,
8
+ "test_accuracy": 0.922,
9
+ "test_loss": 0.3042304813861847,
10
+ "test_runtime": 26.4191,
11
+ "test_samples_per_second": 378.514,
12
+ "test_steps_per_second": 1.514,
13
+ "train_loss": 0.7083145791847069,
14
+ "train_runtime": 34418.8887,
15
+ "train_samples_per_second": 123.479,
16
+ "train_steps_per_second": 0.967
17
+ }
checkpoint-13653/config.json ADDED
@@ -0,0 +1,229 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "google/vit-base-patch16-224-in21k",
3
+ "architectures": [
4
+ "ViTForImageClassification"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.0,
7
+ "encoder_stride": 16,
8
+ "finetuning_task": "image-classification",
9
+ "hidden_act": "gelu",
10
+ "hidden_dropout_prob": 0.0,
11
+ "hidden_size": 768,
12
+ "id2label": {
13
+ "0": "apple",
14
+ "1": "aquarium_fish",
15
+ "10": "bowl",
16
+ "11": "boy",
17
+ "12": "bridge",
18
+ "13": "bus",
19
+ "14": "butterfly",
20
+ "15": "camel",
21
+ "16": "can",
22
+ "17": "castle",
23
+ "18": "caterpillar",
24
+ "19": "cattle",
25
+ "2": "baby",
26
+ "20": "chair",
27
+ "21": "chimpanzee",
28
+ "22": "clock",
29
+ "23": "cloud",
30
+ "24": "cockroach",
31
+ "25": "couch",
32
+ "26": "cra",
33
+ "27": "crocodile",
34
+ "28": "cup",
35
+ "29": "dinosaur",
36
+ "3": "bear",
37
+ "30": "dolphin",
38
+ "31": "elephant",
39
+ "32": "flatfish",
40
+ "33": "forest",
41
+ "34": "fox",
42
+ "35": "girl",
43
+ "36": "hamster",
44
+ "37": "house",
45
+ "38": "kangaroo",
46
+ "39": "keyboard",
47
+ "4": "beaver",
48
+ "40": "lamp",
49
+ "41": "lawn_mower",
50
+ "42": "leopard",
51
+ "43": "lion",
52
+ "44": "lizard",
53
+ "45": "lobster",
54
+ "46": "man",
55
+ "47": "maple_tree",
56
+ "48": "motorcycle",
57
+ "49": "mountain",
58
+ "5": "bed",
59
+ "50": "mouse",
60
+ "51": "mushroom",
61
+ "52": "oak_tree",
62
+ "53": "orange",
63
+ "54": "orchid",
64
+ "55": "otter",
65
+ "56": "palm_tree",
66
+ "57": "pear",
67
+ "58": "pickup_truck",
68
+ "59": "pine_tree",
69
+ "6": "bee",
70
+ "60": "plain",
71
+ "61": "plate",
72
+ "62": "poppy",
73
+ "63": "porcupine",
74
+ "64": "possum",
75
+ "65": "rabbit",
76
+ "66": "raccoon",
77
+ "67": "ray",
78
+ "68": "road",
79
+ "69": "rocket",
80
+ "7": "beetle",
81
+ "70": "rose",
82
+ "71": "sea",
83
+ "72": "seal",
84
+ "73": "shark",
85
+ "74": "shrew",
86
+ "75": "skunk",
87
+ "76": "skyscraper",
88
+ "77": "snail",
89
+ "78": "snake",
90
+ "79": "spider",
91
+ "8": "bicycle",
92
+ "80": "squirrel",
93
+ "81": "streetcar",
94
+ "82": "sunflower",
95
+ "83": "sweet_pepper",
96
+ "84": "table",
97
+ "85": "tank",
98
+ "86": "telephone",
99
+ "87": "television",
100
+ "88": "tiger",
101
+ "89": "tractor",
102
+ "9": "bottle",
103
+ "90": "train",
104
+ "91": "trout",
105
+ "92": "tulip",
106
+ "93": "turtle",
107
+ "94": "wardrobe",
108
+ "95": "whale",
109
+ "96": "willow_tree",
110
+ "97": "wolf",
111
+ "98": "woman",
112
+ "99": "worm"
113
+ },
114
+ "image_size": 224,
115
+ "initializer_range": 0.02,
116
+ "intermediate_size": 3072,
117
+ "label2id": {
118
+ "apple": "0",
119
+ "aquarium_fish": "1",
120
+ "baby": "2",
121
+ "bear": "3",
122
+ "beaver": "4",
123
+ "bed": "5",
124
+ "bee": "6",
125
+ "beetle": "7",
126
+ "bicycle": "8",
127
+ "bottle": "9",
128
+ "bowl": "10",
129
+ "boy": "11",
130
+ "bridge": "12",
131
+ "bus": "13",
132
+ "butterfly": "14",
133
+ "camel": "15",
134
+ "can": "16",
135
+ "castle": "17",
136
+ "caterpillar": "18",
137
+ "cattle": "19",
138
+ "chair": "20",
139
+ "chimpanzee": "21",
140
+ "clock": "22",
141
+ "cloud": "23",
142
+ "cockroach": "24",
143
+ "couch": "25",
144
+ "cra": "26",
145
+ "crocodile": "27",
146
+ "cup": "28",
147
+ "dinosaur": "29",
148
+ "dolphin": "30",
149
+ "elephant": "31",
150
+ "flatfish": "32",
151
+ "forest": "33",
152
+ "fox": "34",
153
+ "girl": "35",
154
+ "hamster": "36",
155
+ "house": "37",
156
+ "kangaroo": "38",
157
+ "keyboard": "39",
158
+ "lamp": "40",
159
+ "lawn_mower": "41",
160
+ "leopard": "42",
161
+ "lion": "43",
162
+ "lizard": "44",
163
+ "lobster": "45",
164
+ "man": "46",
165
+ "maple_tree": "47",
166
+ "motorcycle": "48",
167
+ "mountain": "49",
168
+ "mouse": "50",
169
+ "mushroom": "51",
170
+ "oak_tree": "52",
171
+ "orange": "53",
172
+ "orchid": "54",
173
+ "otter": "55",
174
+ "palm_tree": "56",
175
+ "pear": "57",
176
+ "pickup_truck": "58",
177
+ "pine_tree": "59",
178
+ "plain": "60",
179
+ "plate": "61",
180
+ "poppy": "62",
181
+ "porcupine": "63",
182
+ "possum": "64",
183
+ "rabbit": "65",
184
+ "raccoon": "66",
185
+ "ray": "67",
186
+ "road": "68",
187
+ "rocket": "69",
188
+ "rose": "70",
189
+ "sea": "71",
190
+ "seal": "72",
191
+ "shark": "73",
192
+ "shrew": "74",
193
+ "skunk": "75",
194
+ "skyscraper": "76",
195
+ "snail": "77",
196
+ "snake": "78",
197
+ "spider": "79",
198
+ "squirrel": "80",
199
+ "streetcar": "81",
200
+ "sunflower": "82",
201
+ "sweet_pepper": "83",
202
+ "table": "84",
203
+ "tank": "85",
204
+ "telephone": "86",
205
+ "television": "87",
206
+ "tiger": "88",
207
+ "tractor": "89",
208
+ "train": "90",
209
+ "trout": "91",
210
+ "tulip": "92",
211
+ "turtle": "93",
212
+ "wardrobe": "94",
213
+ "whale": "95",
214
+ "willow_tree": "96",
215
+ "wolf": "97",
216
+ "woman": "98",
217
+ "worm": "99"
218
+ },
219
+ "layer_norm_eps": 1e-12,
220
+ "model_type": "vit",
221
+ "num_attention_heads": 12,
222
+ "num_channels": 3,
223
+ "num_hidden_layers": 12,
224
+ "patch_size": 16,
225
+ "problem_type": "single_label_classification",
226
+ "qkv_bias": true,
227
+ "torch_dtype": "float32",
228
+ "transformers_version": "4.39.3"
229
+ }
checkpoint-13653/model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0257506fa76ffb27f5407a6d29e3352ba67ea82f776c3e7dc924e78015e648f2
3
+ size 343525432
checkpoint-13653/optimizer.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b6ece461ede8b011cf59ae7f09365f0a60d206e38eebb60ef80de60ebd742e19
3
+ size 687171770
checkpoint-13653/preprocessor_config.json ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_valid_processor_keys": [
3
+ "images",
4
+ "do_resize",
5
+ "size",
6
+ "resample",
7
+ "do_rescale",
8
+ "rescale_factor",
9
+ "do_normalize",
10
+ "image_mean",
11
+ "image_std",
12
+ "return_tensors",
13
+ "data_format",
14
+ "input_data_format"
15
+ ],
16
+ "do_normalize": true,
17
+ "do_rescale": true,
18
+ "do_resize": true,
19
+ "image_mean": [
20
+ 0.5,
21
+ 0.5,
22
+ 0.5
23
+ ],
24
+ "image_processor_type": "ViTImageProcessor",
25
+ "image_std": [
26
+ 0.5,
27
+ 0.5,
28
+ 0.5
29
+ ],
30
+ "resample": 2,
31
+ "rescale_factor": 0.00392156862745098,
32
+ "size": {
33
+ "height": 224,
34
+ "width": 224
35
+ }
36
+ }
checkpoint-13653/rng_state.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b92f274572141e2c41187256d38a9c3edbd1d660e1747a46c3763772628cf99d
3
+ size 14244
checkpoint-13653/scheduler.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f148555bdf5ce3511e1eab6b754645b5da72cdb328e866cca089f2f70f88f829
3
+ size 1064
checkpoint-13653/trainer_state.json ADDED
The diff for this file is too large to render. See raw diff
 
checkpoint-13653/training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e0dcb6d1e5d2e73f17bf0d7f49e82edb04972a9e3077a93cef0dc412132025b7
3
+ size 4920
config.json ADDED
@@ -0,0 +1,229 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "google/vit-base-patch16-224-in21k",
3
+ "architectures": [
4
+ "ViTForImageClassification"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.0,
7
+ "encoder_stride": 16,
8
+ "finetuning_task": "image-classification",
9
+ "hidden_act": "gelu",
10
+ "hidden_dropout_prob": 0.0,
11
+ "hidden_size": 768,
12
+ "id2label": {
13
+ "0": "apple",
14
+ "1": "aquarium_fish",
15
+ "10": "bowl",
16
+ "11": "boy",
17
+ "12": "bridge",
18
+ "13": "bus",
19
+ "14": "butterfly",
20
+ "15": "camel",
21
+ "16": "can",
22
+ "17": "castle",
23
+ "18": "caterpillar",
24
+ "19": "cattle",
25
+ "2": "baby",
26
+ "20": "chair",
27
+ "21": "chimpanzee",
28
+ "22": "clock",
29
+ "23": "cloud",
30
+ "24": "cockroach",
31
+ "25": "couch",
32
+ "26": "cra",
33
+ "27": "crocodile",
34
+ "28": "cup",
35
+ "29": "dinosaur",
36
+ "3": "bear",
37
+ "30": "dolphin",
38
+ "31": "elephant",
39
+ "32": "flatfish",
40
+ "33": "forest",
41
+ "34": "fox",
42
+ "35": "girl",
43
+ "36": "hamster",
44
+ "37": "house",
45
+ "38": "kangaroo",
46
+ "39": "keyboard",
47
+ "4": "beaver",
48
+ "40": "lamp",
49
+ "41": "lawn_mower",
50
+ "42": "leopard",
51
+ "43": "lion",
52
+ "44": "lizard",
53
+ "45": "lobster",
54
+ "46": "man",
55
+ "47": "maple_tree",
56
+ "48": "motorcycle",
57
+ "49": "mountain",
58
+ "5": "bed",
59
+ "50": "mouse",
60
+ "51": "mushroom",
61
+ "52": "oak_tree",
62
+ "53": "orange",
63
+ "54": "orchid",
64
+ "55": "otter",
65
+ "56": "palm_tree",
66
+ "57": "pear",
67
+ "58": "pickup_truck",
68
+ "59": "pine_tree",
69
+ "6": "bee",
70
+ "60": "plain",
71
+ "61": "plate",
72
+ "62": "poppy",
73
+ "63": "porcupine",
74
+ "64": "possum",
75
+ "65": "rabbit",
76
+ "66": "raccoon",
77
+ "67": "ray",
78
+ "68": "road",
79
+ "69": "rocket",
80
+ "7": "beetle",
81
+ "70": "rose",
82
+ "71": "sea",
83
+ "72": "seal",
84
+ "73": "shark",
85
+ "74": "shrew",
86
+ "75": "skunk",
87
+ "76": "skyscraper",
88
+ "77": "snail",
89
+ "78": "snake",
90
+ "79": "spider",
91
+ "8": "bicycle",
92
+ "80": "squirrel",
93
+ "81": "streetcar",
94
+ "82": "sunflower",
95
+ "83": "sweet_pepper",
96
+ "84": "table",
97
+ "85": "tank",
98
+ "86": "telephone",
99
+ "87": "television",
100
+ "88": "tiger",
101
+ "89": "tractor",
102
+ "9": "bottle",
103
+ "90": "train",
104
+ "91": "trout",
105
+ "92": "tulip",
106
+ "93": "turtle",
107
+ "94": "wardrobe",
108
+ "95": "whale",
109
+ "96": "willow_tree",
110
+ "97": "wolf",
111
+ "98": "woman",
112
+ "99": "worm"
113
+ },
114
+ "image_size": 224,
115
+ "initializer_range": 0.02,
116
+ "intermediate_size": 3072,
117
+ "label2id": {
118
+ "apple": "0",
119
+ "aquarium_fish": "1",
120
+ "baby": "2",
121
+ "bear": "3",
122
+ "beaver": "4",
123
+ "bed": "5",
124
+ "bee": "6",
125
+ "beetle": "7",
126
+ "bicycle": "8",
127
+ "bottle": "9",
128
+ "bowl": "10",
129
+ "boy": "11",
130
+ "bridge": "12",
131
+ "bus": "13",
132
+ "butterfly": "14",
133
+ "camel": "15",
134
+ "can": "16",
135
+ "castle": "17",
136
+ "caterpillar": "18",
137
+ "cattle": "19",
138
+ "chair": "20",
139
+ "chimpanzee": "21",
140
+ "clock": "22",
141
+ "cloud": "23",
142
+ "cockroach": "24",
143
+ "couch": "25",
144
+ "cra": "26",
145
+ "crocodile": "27",
146
+ "cup": "28",
147
+ "dinosaur": "29",
148
+ "dolphin": "30",
149
+ "elephant": "31",
150
+ "flatfish": "32",
151
+ "forest": "33",
152
+ "fox": "34",
153
+ "girl": "35",
154
+ "hamster": "36",
155
+ "house": "37",
156
+ "kangaroo": "38",
157
+ "keyboard": "39",
158
+ "lamp": "40",
159
+ "lawn_mower": "41",
160
+ "leopard": "42",
161
+ "lion": "43",
162
+ "lizard": "44",
163
+ "lobster": "45",
164
+ "man": "46",
165
+ "maple_tree": "47",
166
+ "motorcycle": "48",
167
+ "mountain": "49",
168
+ "mouse": "50",
169
+ "mushroom": "51",
170
+ "oak_tree": "52",
171
+ "orange": "53",
172
+ "orchid": "54",
173
+ "otter": "55",
174
+ "palm_tree": "56",
175
+ "pear": "57",
176
+ "pickup_truck": "58",
177
+ "pine_tree": "59",
178
+ "plain": "60",
179
+ "plate": "61",
180
+ "poppy": "62",
181
+ "porcupine": "63",
182
+ "possum": "64",
183
+ "rabbit": "65",
184
+ "raccoon": "66",
185
+ "ray": "67",
186
+ "road": "68",
187
+ "rocket": "69",
188
+ "rose": "70",
189
+ "sea": "71",
190
+ "seal": "72",
191
+ "shark": "73",
192
+ "shrew": "74",
193
+ "skunk": "75",
194
+ "skyscraper": "76",
195
+ "snail": "77",
196
+ "snake": "78",
197
+ "spider": "79",
198
+ "squirrel": "80",
199
+ "streetcar": "81",
200
+ "sunflower": "82",
201
+ "sweet_pepper": "83",
202
+ "table": "84",
203
+ "tank": "85",
204
+ "telephone": "86",
205
+ "television": "87",
206
+ "tiger": "88",
207
+ "tractor": "89",
208
+ "train": "90",
209
+ "trout": "91",
210
+ "tulip": "92",
211
+ "turtle": "93",
212
+ "wardrobe": "94",
213
+ "whale": "95",
214
+ "willow_tree": "96",
215
+ "wolf": "97",
216
+ "woman": "98",
217
+ "worm": "99"
218
+ },
219
+ "layer_norm_eps": 1e-12,
220
+ "model_type": "vit",
221
+ "num_attention_heads": 12,
222
+ "num_channels": 3,
223
+ "num_hidden_layers": 12,
224
+ "patch_size": 16,
225
+ "problem_type": "single_label_classification",
226
+ "qkv_bias": true,
227
+ "torch_dtype": "float32",
228
+ "transformers_version": "4.39.3"
229
+ }
dr_results.json ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "dr_accuracy": 0.9849176470588236,
3
+ "dr_loss": 0.08670716732740402,
4
+ "dr_runtime": 105.5564,
5
+ "dr_samples_per_second": 402.628,
6
+ "dr_steps_per_second": 1.582,
7
+ "epoch": 100.0
8
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0257506fa76ffb27f5407a6d29e3352ba67ea82f776c3e7dc924e78015e648f2
3
+ size 343525432
preprocessor_config.json ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_valid_processor_keys": [
3
+ "images",
4
+ "do_resize",
5
+ "size",
6
+ "resample",
7
+ "do_rescale",
8
+ "rescale_factor",
9
+ "do_normalize",
10
+ "image_mean",
11
+ "image_std",
12
+ "return_tensors",
13
+ "data_format",
14
+ "input_data_format"
15
+ ],
16
+ "do_normalize": true,
17
+ "do_rescale": true,
18
+ "do_resize": true,
19
+ "image_mean": [
20
+ 0.5,
21
+ 0.5,
22
+ 0.5
23
+ ],
24
+ "image_processor_type": "ViTImageProcessor",
25
+ "image_std": [
26
+ 0.5,
27
+ 0.5,
28
+ 0.5
29
+ ],
30
+ "resample": 2,
31
+ "rescale_factor": 0.00392156862745098,
32
+ "size": {
33
+ "height": 224,
34
+ "width": 224
35
+ }
36
+ }
test_results.json ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 100.0,
3
+ "test_accuracy": 0.922,
4
+ "test_loss": 0.3042304813861847,
5
+ "test_runtime": 26.4191,
6
+ "test_samples_per_second": 378.514,
7
+ "test_steps_per_second": 1.514
8
+ }
train_results.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 100.0,
3
+ "train_loss": 0.7083145791847069,
4
+ "train_runtime": 34418.8887,
5
+ "train_samples_per_second": 123.479,
6
+ "train_steps_per_second": 0.967
7
+ }
trainer_state.json ADDED
The diff for this file is too large to render. See raw diff
 
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e0dcb6d1e5d2e73f17bf0d7f49e82edb04972a9e3077a93cef0dc412132025b7
3
+ size 4920