matlok
's Collections
Papers - Document - OCR
updated
Noise-Aware Training of Layout-Aware Language Models
Paper
•
2404.00488
•
Published
•
8
FormNet: Structural Encoding beyond Sequential Modeling in Form Document
Information Extraction
Paper
•
2203.08411
•
Published
•
1
FormNetV2: Multimodal Graph Contrastive Learning for Form Document
Information Extraction
Paper
•
2305.02549
•
Published
•
6
ETC: Encoding Long and Structured Inputs in Transformers
Paper
•
2004.08483
•
Published
•
1
CascadeTabNet: An approach for end to end table detection and structure
recognition from image-based documents
Paper
•
2004.12629
•
Published
•
2
LayoutLMv3: Pre-training for Document AI with Unified Text and Image
Masking
Paper
•
2204.08387
•
Published
•
2
Elephants Never Forget: Memorization and Learning of Tabular Data in
Large Language Models
Paper
•
2404.06209
•
Published
•
4
Text Role Classification in Scientific Charts Using Multimodal
Transformers
Paper
•
2402.14579
•
Published
•
1
An inclusive review on deep learning techniques and their scope in
handwriting recognition
Paper
•
2404.08011
•
Published
•
1
NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for
Document Enhancement
Paper
•
2404.05669
•
Published
•
1