Update README.md
Browse files
README.md
CHANGED
@@ -16,18 +16,17 @@ TF-ID (Table/Figure IDentifier) is a family of object detection models finetuned
|
|
16 |
| Model | Model size | Model Description |
|
17 |
| ------- | ------------- | ------------- |
|
18 |
| TF-ID-base[[HF]](https://huggingface.co/yifeihu/TF-ID-base) | 0.23B | Extract tables/figures and their caption text
|
19 |
-
| TF-ID-large[[HF]](https://huggingface.co/yifeihu/TF-ID-large) | 0.77B | Extract tables/figures and their caption text
|
20 |
| TF-ID-base-no-caption[[HF]](https://huggingface.co/yifeihu/TF-ID-base-no-caption) | 0.23B | Extract tables/figures without caption text
|
21 |
-
| TF-ID-large-no-caption[[HF]](https://huggingface.co/yifeihu/TF-ID-large-no-caption) | 0.77B | Extract tables/figures without caption text
|
22 |
All TF-ID models are finetuned from [microsoft/Florence-2](https://huggingface.co/microsoft/Florence-2-large-ft) checkpoints.
|
23 |
|
24 |
-
The models were finetuned with papers from Hugging Face Daily Papers. All bounding boxes are manually annotated and checked by humans.
|
|
|
|
|
|
|
25 |
|
26 |
-
|
27 |
-
|
28 |
-
TF-ID-base and TF-ID-large draw bounding boxes around tables/figures and their caption text.
|
29 |
-
|
30 |
-
TF-ID-base-no-caption and TF-ID-large-no-caption draw bounding boxes around tables/figures without their caption text.
|
31 |
|
32 |
![image/png](https://huggingface.co/yifeihu/TF-ID-base/resolve/main/td-id-caption.png)
|
33 |
|
@@ -91,7 +90,9 @@ To visualize the results, see [this tutorial notebook](https://colab.research.go
|
|
91 |
|
92 |
## Finetuning Code and Dataset
|
93 |
|
94 |
-
|
|
|
|
|
95 |
|
96 |
## BibTex and citation info
|
97 |
|
|
|
16 |
| Model | Model size | Model Description |
|
17 |
| ------- | ------------- | ------------- |
|
18 |
| TF-ID-base[[HF]](https://huggingface.co/yifeihu/TF-ID-base) | 0.23B | Extract tables/figures and their caption text
|
19 |
+
| TF-ID-large[[HF]](https://huggingface.co/yifeihu/TF-ID-large) (Recommended) | 0.77B | Extract tables/figures and their caption text
|
20 |
| TF-ID-base-no-caption[[HF]](https://huggingface.co/yifeihu/TF-ID-base-no-caption) | 0.23B | Extract tables/figures without caption text
|
21 |
+
| TF-ID-large-no-caption[[HF]](https://huggingface.co/yifeihu/TF-ID-large-no-caption) (Recommended) | 0.77B | Extract tables/figures without caption text
|
22 |
All TF-ID models are finetuned from [microsoft/Florence-2](https://huggingface.co/microsoft/Florence-2-large-ft) checkpoints.
|
23 |
|
24 |
+
- The models were finetuned with papers from Hugging Face Daily Papers. All bounding boxes are manually annotated and checked by humans.
|
25 |
+
- TF-ID models take an image of a single paper page as the input, and return bounding boxes for all tables and figures in the given page.
|
26 |
+
- TF-ID-base and TF-ID-large draw bounding boxes around tables/figures and their caption text.
|
27 |
+
- TF-ID-base-no-caption and TF-ID-large-no-caption draw bounding boxes around tables/figures without their caption text.
|
28 |
|
29 |
+
**Large models are always recommended!**
|
|
|
|
|
|
|
|
|
30 |
|
31 |
![image/png](https://huggingface.co/yifeihu/TF-ID-base/resolve/main/td-id-caption.png)
|
32 |
|
|
|
90 |
|
91 |
## Finetuning Code and Dataset
|
92 |
|
93 |
+
Dataset: [yifeihu/TF-ID-arxiv-papers](https://huggingface.co/datasets/yifeihu/TF-ID-arxiv-papers)
|
94 |
+
|
95 |
+
Code: Coming soon!
|
96 |
|
97 |
## BibTex and citation info
|
98 |
|