adamchanadam
commited on
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -20,10 +20,10 @@ This model is fine-tuned from [microsoft/Florence-2-base-ft](https://huggingface
|
|
20 |
- Fine-tuned for answering questions about images, specifically focused on logo recognition and company information.
|
21 |
- The model uses the `<DocVQA>` prompt to indicate the task type.
|
22 |
- Number of unique images: 28
|
23 |
-
- Number of epochs:
|
24 |
-
- Learning rate:
|
25 |
- Optimizer: AdamW
|
26 |
-
- Early stopping: Patience of
|
27 |
|
28 |
Dataset statistics: Total number of questions for fine-tuning: 560.
|
29 |
logo_recognition: 49 (8.75%) brand_identification: 48 (8.57%) visual_elements: 65 (11.61%) text_in_logo: 57 (10.18%) industry_classification: 49 (8.75%) product_service: 55 (9.82%) company_details: 89 (15.89%) negative_sample: 148 (26.43%)
|
@@ -38,7 +38,7 @@ logo_recognition: 49 (8.75%) brand_identification: 48 (8.57%) visual_elements: 6
|
|
38 |
- Images were resized and normalized according to Florence-2's preprocessing standards.
|
39 |
- The `<DocVQA>` prompt was used during fine-tuning to indicate the task type.
|
40 |
- Questions and answers were provided for each image in the training set.
|
41 |
-
- Batch size:
|
42 |
- Evaluation metric: Cross-entropy loss on a held-out validation set
|
43 |
|
44 |
For more information, please contact the model creators.
|
|
|
20 |
- Fine-tuned for answering questions about images, specifically focused on logo recognition and company information.
|
21 |
- The model uses the `<DocVQA>` prompt to indicate the task type.
|
22 |
- Number of unique images: 28
|
23 |
+
- Number of epochs: 7
|
24 |
+
- Learning rate: 1e-06
|
25 |
- Optimizer: AdamW
|
26 |
+
- Early stopping: Patience of 2 epochs, delta of 0.0001
|
27 |
|
28 |
Dataset statistics: Total number of questions for fine-tuning: 560.
|
29 |
logo_recognition: 49 (8.75%) brand_identification: 48 (8.57%) visual_elements: 65 (11.61%) text_in_logo: 57 (10.18%) industry_classification: 49 (8.75%) product_service: 55 (9.82%) company_details: 89 (15.89%) negative_sample: 148 (26.43%)
|
|
|
38 |
- Images were resized and normalized according to Florence-2's preprocessing standards.
|
39 |
- The `<DocVQA>` prompt was used during fine-tuning to indicate the task type.
|
40 |
- Questions and answers were provided for each image in the training set.
|
41 |
+
- Batch size: 4
|
42 |
- Evaluation metric: Cross-entropy loss on a held-out validation set
|
43 |
|
44 |
For more information, please contact the model creators.
|