Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Domain-Specific
Computer Vision
›
Domain-Specific
›
Document Analysis
278 directly classified papers
Papers per year
2005: 1
2007: 1
2009: 1
2011: 1
2013: 2
2014: 1
2015: 1
2016: 1
2017: 3
2018: 7
2019: 10
2020: 19
2021: 16
2022: 31
2023: 44
2024: 43
2025: 94
2026: 2
Papers
Improving Scene Text Image Super-resolution via Dual Prior Modulation Network
AAAI 2023
Exploring Stroke-Level Modifications for Scene Text Editing
AAAI 2023
Beyond Layout Embedding: Layout Attention with Gaussian Biases for Structured Document Understanding
EMNLP 2023
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
ICCV 2023
Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding
ACL 2023
QueryForm: A Simple Zero-shot Form Entity Query Framework
ACL 2023
M2C: Towards Automatic Multimodal Manga Complement
EMNLP 2023
LayoutDIT: Layout-Aware End-to-End Document Image Translation with Multi-Step Conductive Decoder
EMNLP 2023
Evaluating Out-of-Distribution Performance on Document Image Classifiers
NIPS 2022
DocQueryNet: Value Retrieval with Arbitrary Queries for Form-like Documents
COLING 2022
Automatic ICD Coding Exploiting Discourse Structure and Reconciled Code Embeddings
COLING 2022
Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout Analysis
COLING 2022
GMN: Generative Multi-modal Network for Practical Document Information Extraction
NAACL 2022
Language-Independent Approach for Morphological Disambiguation
COLING 2022
C3-STISR: Scene Text Image Super-resolution with Triple Clues
IJCAI 2022
Category-Specific Nuance Exploration Network for Fine-Grained Object Retrieval
AAAI 2022
TDv2: A Novel Tree-Structured Decoder for Offline Mathematical Expression Recognition
AAAI 2022
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
ACL 2022
MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document Understanding
ACL 2022
OCR Improves Machine Translation for Low-Resource Languages
ACL 2022
XFUND: A Benchmark Dataset for Multilingual Visually Rich Form Understanding
ACL 2022
Long Text and Multi-Table Summarization: Dataset and Method
EMNLP 2022
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
EMNLP 2022
A Benchmark and Dataset for Post-OCR text correction in Sanskrit
EMNLP 2022
Towards End-to-End Unified Scene Text Detection and Layout Analysis
CVPR 2022
<
1
…
7
8
9
…
12
>