Papers
235 papers found
The Impact of Inference Acceleration on Bias of LLMs
Elisabeth Kirsten, Ivan Habernal, Vedant Nanda et al.
LLMs as Meta-Reviewers’ Assistants: A Case Study
Eftekhar Hossain, Sanjeev Kumar Sinha, Naman Bansal et al.
Reverse Thinking Makes LLMs Stronger Reasoners
Justin Chen, Zifeng Wang, Hamid Palangi et al.
Social Intelligence in the Age of LLMs
Hao Zhu, Bodhisattwa Prasad Majumder, Dirk Hovy et al.
AutoClean: LLMs Can Prepare Their Training Corpus
Xingyu Shen, Shengding Hu, Xinrong Zhang et al.
CausalGraph2LLM: Evaluating LLMs for Causal Queries
Ivaxi Sheth, Bahare Fatemi, Mario Fritz
Attention Tracker: Detecting Prompt Injection Attacks in LLMs
Kuo-Han Hung, Ching-Yun Ko, Ambrish Rawat et al.
DHP Benchmark: Are LLMs Good NLG Evaluators?
Yicheng Wang, Jiayi Yuan, Yu-Neng Chuang et al.
Using LLMs to Advance Idiom Corpus Construction
Doğukan Arslan, Hüseyin Anıl Çakmak, Gülşen Eryiğit et al.
Multi-lingual Multi-turn Automated Red Teaming for LLMs
Abhishek Singhania, Christophe Dupuy, Shivam Sadashiv Mangale et al.
How Good Are LLMs at Processing Tool Outputs?
Kiran Kate, Yara Rizk, Poulami Ghosh et al.
H3Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMs
Selim Furkan Tekin, Fatih Ilhan, Sihao Hu et al.
Redefining Retrieval Evaluation in the Era of LLMs
Giovanni Trappolini, Florin Cuconasu, Simone Filice et al.
LLMs Know More About Numbers than They Can Say
Fengting Yuchi, Li Du, Jason Eisner
Are Multimodal LLMs Movie Buffs?
Carlo Bretti, Pascal Mettes, Nanne Van Noord
ExpressivityBench: Can LLMs Communicate Implicitly?
Joshua Tint, Som Sagar, Aditya Taparia et al.
Can LLMs Translate Italy’s Language Varieties?
Edoardo Signoroni, Pavel Rychlý
Linguistics to LLMs: Teaching with and about Chatbots
Ulrike Pado, Barbara Pampel
MemeBQ:Memory Efficient Binary Quantization of LLMs
Yuanhui Wang, Kunlong Liu, Minnan Pei et al.
Can Editing LLMs Inject Harm?
Canyu Chen, Baixiang Huang, Zekun Li et al.
RLKD: Distilling LLMs’ Reasoning via Reinforcement Learning
Shicheng Xu, Liang Pang, Yunchang Zhu et al.
DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs
Oluwanifemi Bamgbose, Masoud Hashemi, Sathwik Tejaswi Madhusudhan et al.
Silenced Biases: The Dark Side LLMs Learned to Refuse
Rom Himelstein, Amit LeVi, Brit Youngmann et al.
Can LLMs Identify Tax Abuse?
Andrew Blair-Stanek, Nils Holzenberger, Benjamin Van Durme
Native Speech Processing with LLMs
Aaron Soh