Cost-effective End-to-end Information Extraction for Semi-structured Document Images

Wonseok Hwang; Hyunji Lee; Jinyeong Yim; Geewook Kim; Minjoon Seo

2021 EMNLP EMNLP 2021

Cost-effective End-to-end Information Extraction for Semi-structured Document Images

Abstract

AbstractA real-world information extraction (IE) system for semi-structured document images often involves a long pipeline of multiple modules, whose complexity dramatically increases its development and maintenance cost. One can instead consider an end-to-end model that directly maps the input to the target output and simplify the entire process. However, such generation approach is known to lead to unstable performance if not designed carefully. Here we present our recent effort on transitioning from our existing pipeline-based IE system to an end-to-end system focusing on practical challenges that are associated with replacing and deploying the system in real, large-scale production. By carefully formulating document IE as a sequence generation task, we show that a single end-to-end IE system can be built and still achieve competent performance.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Wonseok Hwang , Hyunji Lee , Jinyeong Yim , Geewook Kim , Minjoon Seo

Topics

Machine Learning > Core Methods > Classification Machine Learning > Application Areas > Efficient Computing Natural Language Processing > Applications > Information Extraction Computer Vision > Domain-Specific > Document Analysis Deep Learning > Learning Types > Deep Learning Deep Learning > Learning Types > Transfer Learning

Keywords

sequence generation information extraction document understanding end-to-end learning semi-structured document end-to-end model document image

Download PDF

Related papers

Continual Learning in Multilingual NMT via Language-Specific Embeddings 2021

MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents 2021

Efficient Multi-Task Auxiliary Learning: Selecting Auxiliary Data by Feature Similarity 2021

Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings 2021

Semantics-Preserved Data Augmentation for Aspect-Based Sentiment Analysis 2021