adamchanadam commited on
Commit
553ef06
·
verified ·
1 Parent(s): 4cd33f3

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -20,10 +20,10 @@ This model is fine-tuned from [microsoft/Florence-2-base-ft](https://huggingface
20
  - Fine-tuned for answering questions about images, specifically focused on logo recognition and company information.
21
  - The model uses the `<DocVQA>` prompt to indicate the task type.
22
  - Number of unique images: 28
23
- - Number of epochs: 15
24
- - Learning rate: 5e-06
25
  - Optimizer: AdamW
26
- - Early stopping: Patience of 3 epochs, delta of 0.01
27
 
28
  Dataset statistics: Total number of questions for fine-tuning: 560.
29
  logo_recognition: 49 (8.75%) brand_identification: 48 (8.57%) visual_elements: 65 (11.61%) text_in_logo: 57 (10.18%) industry_classification: 49 (8.75%) product_service: 55 (9.82%) company_details: 89 (15.89%) negative_sample: 148 (26.43%)
@@ -38,7 +38,7 @@ logo_recognition: 49 (8.75%) brand_identification: 48 (8.57%) visual_elements: 6
38
  - Images were resized and normalized according to Florence-2's preprocessing standards.
39
  - The `<DocVQA>` prompt was used during fine-tuning to indicate the task type.
40
  - Questions and answers were provided for each image in the training set.
41
- - Batch size: 2
42
  - Evaluation metric: Cross-entropy loss on a held-out validation set
43
 
44
  For more information, please contact the model creators.
 
20
  - Fine-tuned for answering questions about images, specifically focused on logo recognition and company information.
21
  - The model uses the `<DocVQA>` prompt to indicate the task type.
22
  - Number of unique images: 28
23
+ - Number of epochs: 7
24
+ - Learning rate: 1e-06
25
  - Optimizer: AdamW
26
+ - Early stopping: Patience of 2 epochs, delta of 0.0001
27
 
28
  Dataset statistics: Total number of questions for fine-tuning: 560.
29
  logo_recognition: 49 (8.75%) brand_identification: 48 (8.57%) visual_elements: 65 (11.61%) text_in_logo: 57 (10.18%) industry_classification: 49 (8.75%) product_service: 55 (9.82%) company_details: 89 (15.89%) negative_sample: 148 (26.43%)
 
38
  - Images were resized and normalized according to Florence-2's preprocessing standards.
39
  - The `<DocVQA>` prompt was used during fine-tuning to indicate the task type.
40
  - Questions and answers were provided for each image in the training set.
41
+ - Batch size: 4
42
  - Evaluation metric: Cross-entropy loss on a held-out validation set
43
 
44
  For more information, please contact the model creators.