Papers

235 papers found
Unequal Scientific Recognition in the Age of LLMs
Yixuan Liu, Abel Elekes, Jianglin Lu et al.
2025 EMNLP
PropXplain: Can LLMs Enable Explainable Propaganda Detection?
Maram Hasanain, Md Arid Hasan, Mohamed Bayan Kmainasi et al.
2025 EMNLP
Can We Edit LLMs for Long-Tail Biomedical Knowledge?
Xinhao Yi, Jake Lever, Kevin Bryson et al.
2025 EMNLP
Zero-Shot Belief: A Hard Problem for LLMs
John Murzaku, Owen Rambow
2025 EMNLP
2025 EMNLP
Social Debiasing for Fair Multi-modal LLMs
Harry Cheng, Yangyang Guo, Qingpei Guo et al.
2025 ICCV
AgentBench: Evaluating LLMs as Agents
Xiao Liu, Hao Yu, Hanchen Zhang et al.
2024 ICLR
SmartPlay : A Benchmark for LLMs as Intelligent Agents
Yue Wu, Xuan Tang, Tom Mitchell et al.
2024 ICLR
Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLMs
Yuxin Zhang, Lirui Zhao, Mingbao Lin et al.
2024 ICLR
RouteLLM: Learning to Route LLMs from Preference Data
Isaac Ong, Amjad Almahairi, Vincent Wu et al.
2025 ICLR
PEARL: Towards Permutation-Resilient LLMs
Liang CHEN, Li Shen, Yang Deng et al.
2025 ICLR
LLMs Can Plan Only If We Tell Them
Bilgehan Sel, Ruoxi Jia, Ming Jin
2025 ICLR
Benchmarking LLMs' Judgments with No Gold Standard
Shengwei Xu, Yuxuan Lu, Grant Schoenebeck et al.
2025 ICLR
MM-EMBED: UNIVERSAL MULTIMODAL RETRIEVAL WITH MULTIMODAL LLMS
Sheng-Chieh Lin, Chankyu Lee, Mohammad Shoeybi et al.
2025 ICLR
Teaching LLMs How to Learn with Contextual Fine-Tuning
Younwoo Choi, Muhammad Adil Asif, Ziwen Han et al.
2025 ICLR
Active Task Disambiguation with LLMs
Kasia Kobalczyk, Nicolás Astorga, Tennison Liu et al.
2025 ICLR
Persistent Pre-training Poisoning of LLMs
Yiming Zhang, Javier Rando, Ivan Evtimov et al.
2025 ICLR
Tamper-Resistant Safeguards for Open-Weight LLMs
Rishub Tamirisa, Bhrugu Bharathi, Long Phan et al.
2025 ICLR
Certifying Counterfactual Bias in LLMs
Isha Chaudhary, Qian Hu, Manoj Kumar et al.
2025 ICLR
SELF-EVOLVED REWARD LEARNING FOR LLMS
Chenghua Huang, Zhizhen Fan, Lu Wang et al.
2025 ICLR
SysBench: Can LLMs Follow System Message?
Yanzhao Qin, Tao Zhang, Tao Zhang et al.
2025 ICLR
From Tokens to Words: On the Inner Lexicon of LLMs
Guy Kaplan, Matanel Oren, Yuval Reif et al.
2025 ICLR
Unified Parameter-Efficient Unlearning for LLMs
Chenlu Ding, Jiancan Wu, Yancheng Yuan et al.
2025 ICLR
2025 ICLR
Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
Jihan Yao, Wenxuan Ding, Shangbin Feng et al.
2025 ICLR