Papers
5,241 papers found
One Thousand and One Pairs: A “novel” challenge for long-context language models
Marzena Karpinska, Katherine Thai, Kyle Lo et al.
Memorize Step by Step: Efficient Long-Context Prefilling with Incremental Memory and Decremental Chunk
Zhiyuan Zeng, Qipeng Guo, Xiaoran Liu et al.
LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering
Qingfei Zhao, Ruobing Wang, Yukuo Cen et al.
TAIL: A Toolkit for Automatic and Realistic Long-Context Large Language Model Evaluation
Gefei Gu, Yilun Zhao, Ruoxi Ning et al.
Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study and Hybrid Approach
Zhuowan Li, Cheng Li, Mingyang Zhang et al.
Systematic Evaluation of Long-Context LLMs on Financial Concepts
Lavanya Gupta, Saket Sharma, Yiyun Zhao
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
Xin Zhang, Yanzhao Zhang, Dingkun Long et al.
LongGenBench: Long-context Generation Benchmark
Xiang Liu, Peijie Dong, Xuming Hu et al.
CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models
Zexuan Qiu, Jingjing Li, Shijue Huang et al.
LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference
Zhongwei Wan, Ziang Wu, Che Liu et al.
Insights into LLM Long-Context Failures: When Transformers Know but Don’t Tell
Muhan Gao, TaiMing Lu, Kuai Yu et al.
GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models
Shilong Li, Yancheng He, Hangyu Guo et al.
Evaluating and Training Long-Context Large Language Models for Question Answering on Scientific Papers
Lukas Hilgert, Danni Liu, Jan Niehues
Evaluating Multilingual Long-Context Models for Retrieval and Reasoning
Ameeta Agrawal, Andy Dang, Sina Bagheri Nezhad et al.
SWAN: An Efficient and Scalable Approach for Long-Context Language Modeling
Krishna C Puvvada, Faisal Ladhak, Santiago Akle Serano et al.
Recall with Reasoning: Chain-of-Thought Distillation for Mamba’s Long-Context Memory and Extrapolation
Jun-Yu Ma, Tianqing Fang, Zhisong Zhang et al.
From General Reward to Targeted Reward: Improving Open-ended Long-context Generation Models
Zhihan Guo, Jiele Wu, Wenqian Cui et al.
Cost-Optimal Grouped-Query Attention for Long-Context Modeling
Yingfa Chen, Yutong Wu, Chenyang Song et al.
Does quantization affect models’ performance on long-context tasks?
Anmol Mekala, Anirudh Atmakuru, Yixiao Song et al.
DocAgent: An Agentic Framework for Multi-Modal Long-Context Document Understanding
Li Sun, Liu He, Shuyue Jia et al.
ToM: Leveraging Tree-oriented MapReduce for Long-Context Reasoning in Large Language Models
Jiani Guo, Zuchao Li, Jie Wu et al.
E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning
Zihan Liao, Jun Wang, Hang Yu et al.
TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection
Wei Wu, Zhuoshi Pan, Kun Fu et al.
Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking
Wuwei Zhang, Fangcong Yin, Howard Yen et al.
ProLongVid: A Simple but Strong Baseline for Long-context Video Instruction Tuning
Rui Wang, Bohao Li, Xiyang Dai et al.