Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Applications
Computer Science
›
Applications
›
Document Analysis
632 directly classified papers
Papers per year
2002: 1
2012: 2
2013: 8
2014: 4
2015: 5
2016: 19
2017: 23
2018: 36
2019: 73
2020: 62
2021: 70
2022: 68
2023: 71
2024: 105
2025: 82
2026: 3
Papers
Armenian AutoEpiDoc: Automated Extraction and Encoding of Armenian Inscriptions into EpiDoc TEI/XML
EACL 2026
OCRTurk: A Comprehensive OCR Benchmark for Turkish
EACL 2026
DocWaveDiff: A Predict-and-Refine approach for Document Image Enhancement with Wavelet U-Nets and Diffusion models
WACV 2026
MSA2: Multi-task Framework with Structure-aware and Style-adaptive Character Representation for Open-set Chinese Text Recognition
ICCV 2025
BuDDIE: A Business Document Dataset for Multi-task Information Extraction
COLING 2025
ForCenNet: Foreground-Centric Network for Document Image Rectification
ICCV 2025
DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems
NAACL 2025
CLERC: A Dataset for U. S. Legal Case Retrieval and Retrieval-Augmented Analysis Generation
NAACL 2025
Py-Elotl: A Python NLP package for the languages of Mexico
NAACL 2025
Exploring Multimodal Language Models for Sustainability Disclosure Extraction: A Comparative Study
NAACL 2025
Filling the Temporal Void: Recovering Missing Publication Years in the Project Gutenberg Corpus Using LLMs
ACL 2025
Structured Information Extraction from Nepali Scanned Documents using Layout Transformer and LLMs
COLING 2025
Sign2Vis: Automated Data Visualization from Sign Language
ACL 2025
BlueNeg: A 35mm Negative Film Dataset for Restoring Channel-Heterogeneous Deterioration
ICCV 2025
Audit-FT at the Regulations Challenge Task: An Open-Source Large Language Model for Audit
COLING 2025
Developing an Informal-Formal Persian Corpus: Highlighting the Differences between Two Writing Styles
COLING 2025
From Intentions to Techniques: A Comprehensive Taxonomy and Challenges in Text Watermarking for Large Language Models
NAACL 2025
NOTA: Multimodal Music Notation Understanding for Visual Large Language Model
NAACL 2025
TableCoder: Table Extraction from Text via Reliable Code Generation
ACL 2025
LAW: Legal Agentic Workflows for Custody and Fund Services Contracts
COLING 2025
FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding
COLING 2025
Optimizing the Arrangement of Citations in Related Work Section
IJCNLP 2025
Roles of MLLMs in Visually Rich Document Retrieval for RAG: A Survey
IJCNLP 2025
Chain-of-Query: Unleashing the Power of LLMs in SQL-Aided Table Understanding via Multi-Agent Collaboration
IJCNLP 2025
Bilingual BSARD: Extending Statutory Article Retrieval to Dutch
COLING 2025
<
1
2
3
4
5
…
26
>