Papers
17,973 papers found
AGENTVIGIL: Automatic Black-Box Red-teaming for Indirect Prompt Injection against LLM Agents
Zhun Wang, Vincent Siu, Zhe Ye et al.
Agent vs. Agent: Automated Data Generation and Red-Teaming for Custom Agentic Workflows
Ninad Kulkarni, Xian Wu, Siddharth Varia et al.
A Good Plan is Hard to Find: Aligning Models with Preferences is Misaligned with What Helps Users
Nishant Balepur, Matthew Shu, Yoo Yeon Sung et al.
A Graph-Theoretical Framework for Analyzing the Behavior of Causal Language Models
Rashin Rahnamoun, Mehrnoush Shamsfard
A Group Fairness Lens for Large Language Models
Guanqun Bi, Yuqiang Xie, Lei Shen et al.
A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM Outputs
Artem Shelmanov, Ekaterina Fadeeva, Akim Tsvigun et al.
AI Argues Differently: Distinct Argumentative and Linguistic Patterns of LLMs in Persuasive Contexts
Esra Dönmez, Maximilian Maurer, Gabriella Lapesa et al.
AI Chatbots as Professional Service Agents: Developing a Professional Identity
Wenwen Li, Kangwei Shi, Yidong Chai
AI Knowledge Assist: An Automated Approach for the Creation of Knowledge Bases for Conversational AI Agents
Md Tahmid Rahman Laskar, Julien Bouvier Tremblay, Xue-Yong Fu et al.
AI Knows Where You Are: Exposure, Bias, and Inference in Multimodal Geolocation with KoreaGEO
Xiaonan Wang, Bo Shao, Hansaem Kim
AIMMerging: Adaptive Iterative Model Merging Using Training Trajectories for Language Model Continual Learning
Yujie Feng, Jian Li, Xiaoyu Dong et al.
AIPOM: Agent-aware Interactive Planning for Multi-Agent Systems
Hannah Kim, Kushan Mitra, Chen Shen et al.
AIP: Subverting Retrieval-Augmented Generation via Adversarial Instructional Prompt
Saket Sanjeev Chaturvedi, Gaurav Bagwe, Lan Emily Zhang et al.
AIR: Complex Instruction Generation via Automatic Iterative Refinement
Wei Liu, Yancheng He, Yu Li et al.
AIRepr: An Analyst-Inspector Framework for Evaluating Reproducibility of LLMs in Data Science
Qiuhai Zeng, Claire Jin, Xinyue Wang et al.
AirRAG: Autonomous Strategic Planning and Reasoning Steer Retrieval Augmented Generation
Wenfeng Feng, Chuzhan Hao, Yuewei Zhang et al.
AI Sees Your Location—But With A Bias Toward The Wealthy World
Jingyuan Huang, Jen-tse Huang, Ziyi Liu et al.
AkibaNLP-TUT: Injecting Language-Specific Word-Level Noise for Low-Resource Language Translation
Shoki Hamada, Tomoyosi Akiba, Hajime Tsukada
A Knapsack by Any Other Name: Presentation impacts LLM performance on NP-hard problems
Alex Duchnowski, Ellie Pavlick, Alexander Koller
A Knowledge-driven Adaptive Collaboration of LLMs for Enhancing Medical Decision-making
Xiao Wu, Ting-Zhu Huang, Liang-Jian Deng et al.
ALARB: An Arabic Legal Argument Reasoning Benchmark
Harethah Abu Shairah, Somayah AlHarbi, Abdulaziz AlHussein et al.
Alif: Advancing Urdu Large Language Models via Multilingual Synthetic Data Distillation
Muhammad Ali Shafique, Kanwal Mehreen, Muhammad Arham et al.
Align Attention Heads Before Merging Them: An Effective Way for Converting MHA to GQA
Qingyun Jin, Xiaohui Song, Feng Zhou et al.
Aligning Black-Box LLMs for Aspect Sentiment Quad Prediction
Shichen Li, Jiawei Zhang, Zhongqing Wang et al.
Aligning Dialogue Agents with Global Feedback via Large Language Model Multimodal Reward Decomposition
Dong Won Lee, Hae Won Park, Cynthia Breazeal et al.