Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Applications
Computer Science
›
Applications
›
Document Analysis
632 directly classified papers
Papers per year
2002: 1
2012: 2
2013: 8
2014: 4
2015: 5
2016: 19
2017: 23
2018: 36
2019: 73
2020: 62
2021: 70
2022: 68
2023: 71
2024: 105
2025: 82
2026: 3
Papers
NOTA: Multimodal Music Notation Understanding for Visual Large Language Model
NAACL 2025
CLERC: A Dataset for U. S. Legal Case Retrieval and Retrieval-Augmented Analysis Generation
NAACL 2025
Bringing Suzhou Numerals into the Digital Age: A Dataset and Recognition Study on Ancient Chinese Trade Records
NAACL 2025
Py-Elotl: A Python NLP package for the languages of Mexico
NAACL 2025
Exploring Multimodal Language Models for Sustainability Disclosure Extraction: A Comparative Study
NAACL 2025
DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems
NAACL 2025
Nayana OCR: A Scalable Framework for Document OCR in Low-Resource Languages
NAACL 2025
Towards Comprehensive Lecture Slides Understanding: Large-scale Dataset and Effective Method
ICCV 2025
CourtNav: Voice-Guided, Anchor-Accurate Navigation of Long Legal Documents in Courtrooms
EMNLP 2025
Can Multimodal Foundation Models Understand Schematic Diagrams? An Empirical Study on Information-Seeking QA over Scientific Papers
ACL 2025
The Million Authors Corpus: A Cross-Lingual and Cross-Domain Wikipedia Dataset for Authorship Verification
ACL 2025
BlueNeg: A 35mm Negative Film Dataset for Restoring Channel-Heterogeneous Deterioration
ICCV 2025
ComicScene154: A Scene Dataset for Comic Analysis
EMNLP 2025
M-LongDoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework
EMNLP 2025
Data Gatherer: LLM-Powered Dataset Reference Extraction from Scientific Literature
ACL 2025
NarratEX Dataset: Explaining the Dominant Narratives in News Texts
EMNLP 2025
Infogen: Generating Complex Statistical Infographics from Documents
ACL 2025
Doc2Chart: Intent-Driven Zero-Shot Chart Generation from Documents
EMNLP 2025
DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster Management
EMNLP 2025
SEAGraph: Unveiling the Whole Story of Paper Review Comments
IJCNLP 2025
TST: A Schema-Based Top-Down and Dynamic-Aware Agent of Text-to-Table Tasks
ACL 2025
ExpLay: A new Corpus Resource for the Research on Expertise as an Influential Factor on Language Production
ACL 2025
READoc: A Unified Benchmark for Realistic Document Structured Extraction
ACL 2025
BOOKCOREF: Coreference Resolution at Book Scale
ACL 2025
Page Stream Segmentation with LLMs: Challenges and Applications in Insurance Document Automation
COLING 2025
<
1
2
3
4
5
…
26
>