view post Post 2424 https://huggingface.co/organizations/nerdyface/share/xvWxWxYmYpCLqZlvNJEZbJHFsDITAicJAT
Document Models (Pretrained) Various pretrained models for analyzing documents. These need to be fine-tuned for a task naver-clova-ix/donut-base Image-to-Text • Updated Aug 13, 2022 • 48.9k • 182 google/pix2struct-base Image-to-Text • Updated Dec 24, 2023 • 5.32k • 66 google/pix2struct-large Image-to-Text • Updated Sep 6, 2023 • 60.4k • 34 microsoft/layoutlmv3-base Updated Apr 10, 2024 • 2.31M • 349
Document Models (Fine-tuned) naver-clova-ix/donut-base-finetuned-cord-v2 Image-to-Text • Updated Aug 13, 2022 • 18.1k • 89 google/pix2struct-docvqa-base Visual Question Answering • Updated Dec 24, 2023 • 8.14k • 37 google/pix2struct-docvqa-large Visual Question Answering • Updated May 19, 2023 • 279 • 31 google/pix2struct-screen2words-base Visual Question Answering • Updated May 19, 2023 • 42 • 22