Hierarchical Catalogue Generation for Literature Review: A Benchmark

Kun Zhu; Xiaocheng Feng; Xiachong Feng; Yingsheng Wu; Bing Qin

2023 EMNLP EMNLP 2023

Hierarchical Catalogue Generation for Literature Review: A Benchmark

Abstract

AbstractScientific literature review generation aims to extract and organize important information from an abundant collection of reference papers and produces corresponding reviews while lacking a clear and logical hierarchy. We observe that a high-quality catalogue-guided generation process can effectively alleviate this problem. Therefore, we present an atomic and challenging task named Hierarchical Catalogue Generation for Literature Review as the first step for review generation, which aims to produce a hierarchical catalogue of a review paper given various references. We construct a novel English Hierarchical Catalogues of Literature Reviews Dataset with 7.6k literature review catalogues and 389k reference papers. To accurately assess the model performance, we design two evaluation metrics for informativeness and similarity to ground truth from semantics and structure. Our extensive analyses verify the high quality of our dataset and the effectiveness of our evaluation metrics. We further benchmark diverse experiments on state-of-the-art summarization models like BART and large language models like ChatGPT to evaluate their capabilities. We further discuss potential directions for this task to motivate future research.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Science and Deep Learning and Natural Language Processing

🧭 Keyword Pioneer — hierarchical catalogue

🐣 Hot Topic Early Bird — document analysis

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Kun Zhu , Xiaocheng Feng , Xiachong Feng , Yingsheng Wu , Bing Qin

Topics

Natural Language Processing > Generation > Summarization Natural Language Processing > Resources & Methods > Large Language Models Computer Science > Applications > Document Analysis Deep Learning > Learning Types > Generative Models Artificial Intelligence > Core AI > Natural Language Generation

Keywords

document summarization text summarization deep learning document analysis hierarchical generation large language model literature review hierarchical catalogue hierarchical text generation

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023