← Domain-Specific

Computer Vision › Domain-Specific ›

Document Analysis

278 directly classified papers

Papers per year

Papers

Improving Scene Text Image Super-resolution via Dual Prior Modulation Network AAAI 2023

Exploring Stroke-Level Modifications for Scene Text Editing AAAI 2023

Beyond Layout Embedding: Layout Attention with Gaussian Biases for Structured Document Understanding EMNLP 2023

ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer ICCV 2023

Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding ACL 2023

QueryForm: A Simple Zero-shot Form Entity Query Framework ACL 2023

M2C: Towards Automatic Multimodal Manga Complement EMNLP 2023

LayoutDIT: Layout-Aware End-to-End Document Image Translation with Multi-Step Conductive Decoder EMNLP 2023

Evaluating Out-of-Distribution Performance on Document Image Classifiers NIPS 2022

DocQueryNet: Value Retrieval with Arbitrary Queries for Form-like Documents COLING 2022

Automatic ICD Coding Exploiting Discourse Structure and Reconciled Code Embeddings COLING 2022

Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout Analysis COLING 2022

GMN: Generative Multi-modal Network for Practical Document Information Extraction NAACL 2022

Language-Independent Approach for Morphological Disambiguation COLING 2022

C3-STISR: Scene Text Image Super-resolution with Triple Clues IJCAI 2022

Category-Specific Nuance Exploration Network for Fine-Grained Object Retrieval AAAI 2022

TDv2: A Novel Tree-Structured Decoder for Offline Mathematical Expression Recognition AAAI 2022

FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction ACL 2022

MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document Understanding ACL 2022

OCR Improves Machine Translation for Low-Resource Languages ACL 2022

XFUND: A Benchmark Dataset for Multilingual Visually Rich Form Understanding ACL 2022

Long Text and Multi-Table Summarization: Dataset and Method EMNLP 2022

ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding EMNLP 2022

A Benchmark and Dataset for Post-OCR text correction in Sanskrit EMNLP 2022

Towards End-to-End Unified Scene Text Detection and Layout Analysis CVPR 2022