Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2204.08387

Awesome Document AI

A collection of open-source document AI 📄 📝 📈

Running on Zero

84

🏃

UDOP
Running on Zero

39

📚

Pix2struct

Play with all the pix2struct variants in this d
Running

24

🦀

Compare Docvqa Models

Compare different visual question answering
Runtime error

290

🦉

DocQuery — Document Query Engine

LayoutLM and Document Intelligence

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Paper • 1912.13318 • Published Dec 31, 2019 • 2
microsoft/layoutlm-base-uncased

Updated Apr 16, 2024 • 1.88M • 47
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

Paper • 2012.14740 • Published Dec 29, 2020 • 1
microsoft/layoutlmv2-base-uncased

Updated Sep 16, 2022 • 416k • 61

Papers - Document AI

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Paper • 1912.13318 • Published Dec 31, 2019 • 2
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

Paper • 2012.14740 • Published Dec 29, 2020 • 1
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2

Papers - Image - OCR - Tesseract for Text Location

CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

Paper • 2004.12629 • Published Apr 27, 2020 • 2
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2

Papers - Image - Table Structure Recognition

CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

Paper • 2004.12629 • Published Apr 27, 2020 • 2
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2

Papers - Image - OCR

CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

Paper • 2004.12629 • Published Apr 27, 2020 • 2
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2
Text Role Classification in Scientific Charts Using Multimodal Transformers

Paper • 2402.14579 • Published Feb 8, 2024 • 1
An inclusive review on deep learning techniques and their scope in handwriting recognition

Paper • 2404.08011 • Published Apr 10, 2024 • 1

Papers - Image - Tabular

CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

Paper • 2004.12629 • Published Apr 27, 2020 • 2
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2

Papers - Documents - Tabular

FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

Paper • 2305.02549 • Published May 4, 2023 • 6
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction

Paper • 2203.08411 • Published Mar 16, 2022 • 1
More efficient manual review of automatically transcribed tabular data

Paper • 2306.16126 • Published Jun 28, 2023 • 1
CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

Paper • 2004.12629 • Published Apr 27, 2020 • 2

Papers - Document - OCR

Noise-Aware Training of Layout-Aware Language Models

Paper • 2404.00488 • Published Mar 30, 2024 • 8
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction

Paper • 2203.08411 • Published Mar 16, 2022 • 1
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

Paper • 2305.02549 • Published May 4, 2023 • 6
ETC: Encoding Long and Structured Inputs in Transformers

Paper • 2004.08483 • Published Apr 17, 2020 • 1

Papers - Documents - LayoutLM

Noise-Aware Training of Layout-Aware Language Models

Paper • 2404.00488 • Published Mar 30, 2024 • 8
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

Paper • 2012.14740 • Published Dec 29, 2020 • 1
LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Paper • 1912.13318 • Published Dec 31, 2019 • 2

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs