yifeihu
/

TF-ID-base

@@ -16,18 +16,17 @@ TF-ID (Table/Figure IDentifier) is a family of object detection models finetuned
 | Model   | Model size | Model Description |
 | ------- | ------------- |   ------------- |
 | TF-ID-base[[HF]](https://huggingface.co/yifeihu/TF-ID-base) | 0.23B  | Extract tables/figures and their caption text
-| TF-ID-large[[HF]](https://huggingface.co/yifeihu/TF-ID-large) | 0.77B  | Extract tables/figures and their caption text
 | TF-ID-base-no-caption[[HF]](https://huggingface.co/yifeihu/TF-ID-base-no-caption) | 0.23B  | Extract tables/figures without caption text
-| TF-ID-large-no-caption[[HF]](https://huggingface.co/yifeihu/TF-ID-large-no-caption) | 0.77B  | Extract tables/figures without caption text
 All TF-ID models are finetuned from [microsoft/Florence-2](https://huggingface.co/microsoft/Florence-2-large-ft) checkpoints.
-The models were finetuned with papers from Hugging Face Daily Papers. All bounding boxes are manually annotated and checked by humans.
-TF-ID models take an image of a single paper page as the input, and return bounding boxes for all tables and figures in the given page.
-TF-ID-base and TF-ID-large draw bounding boxes around tables/figures and their caption text.
-TF-ID-base-no-caption and TF-ID-large-no-caption draw bounding boxes around tables/figures without their caption text.
 ![image/png](https://huggingface.co/yifeihu/TF-ID-base/resolve/main/td-id-caption.png)
@@ -91,7 +90,9 @@ To visualize the results, see [this tutorial notebook](https://colab.research.go
 ## Finetuning Code and Dataset
-Coming soon!
 ## BibTex and citation info

 | Model   | Model size | Model Description |
 | ------- | ------------- |   ------------- |
 | TF-ID-base[[HF]](https://huggingface.co/yifeihu/TF-ID-base) | 0.23B  | Extract tables/figures and their caption text
+| TF-ID-large[[HF]](https://huggingface.co/yifeihu/TF-ID-large) (Recommended) | 0.77B  | Extract tables/figures and their caption text
 | TF-ID-base-no-caption[[HF]](https://huggingface.co/yifeihu/TF-ID-base-no-caption) | 0.23B  | Extract tables/figures without caption text
+| TF-ID-large-no-caption[[HF]](https://huggingface.co/yifeihu/TF-ID-large-no-caption) (Recommended) | 0.77B  | Extract tables/figures without caption text
 All TF-ID models are finetuned from [microsoft/Florence-2](https://huggingface.co/microsoft/Florence-2-large-ft) checkpoints.
+- The models were finetuned with papers from Hugging Face Daily Papers. All bounding boxes are manually annotated and checked by humans.
+- TF-ID models take an image of a single paper page as the input, and return bounding boxes for all tables and figures in the given page.
+- TF-ID-base and TF-ID-large draw bounding boxes around tables/figures and their caption text.
+- TF-ID-base-no-caption and TF-ID-large-no-caption draw bounding boxes around tables/figures without their caption text.
+**Large models are always recommended!**
 ![image/png](https://huggingface.co/yifeihu/TF-ID-base/resolve/main/td-id-caption.png)
 ## Finetuning Code and Dataset
+Dataset: [yifeihu/TF-ID-arxiv-papers](https://huggingface.co/datasets/yifeihu/TF-ID-arxiv-papers)
+Code: Coming soon!
 ## BibTex and citation info