Papers
18,748 papers found
Fairness Perceptions of Large Language Models
Benjamin Cookson, Soroush Ebadian, Nisarg Shah
The Silent Amplifier: In-Context Examples Fuel Bias in Large Language Models
Xinwei Guo, Jiashi Gao, Junlei Zhou et al.
Activation Manipulation Attack: Penetrating and Harmful Jailbreak Attack Against Large Vision-Language Models
Haojie Hao, Jiakai Wang, Aishan Liu et al.
Cross-Modal Unlearning via Influential Neuron Path Editing in Multimodal Large Language Models
Kunhao Li, Wenhao Li, Di Wu et al.
The Other Mind: How Language Models Exhibit Human Temporal Cognition
Lingyu Li, Yang Yao, Yixu Wang et al.
SAVER: Mitigating Hallucinations in Large Vision-Language Models via Style-Aware Visual Early Revision
Zhaoxu Li, Chenqi Kong, Yi Yu et al.
GeoShield: Safeguarding Geolocation Privacy from Vision-Language Models via Adversarial Perturbations
Xinwei Liu, Xiaojun Jia, Yuan Xun et al.
SPAN: Benchmarking and Improving Cross-Calendar Temporal Reasoning of Large Language Models
Zhongjian Miao, Hao Fu, Chen Wei
Probing Semantic Insensitivity for Inference-Time Backdoor Defense in Multimodal Large Language Model
Xuankun Rong, Wenke Huang, Wenzheng Jiang et al.
ConfGuard: A Simple and Effective Backdoor Detection for Large Language Models
Zihan Wang, Rui Zhang, Hongwei Li et al.
A Content-Preserving Secure Linguistic Steganography
Lingyun Xiang, Chengfu Ou, Xu He et al.
Bridging the Copyright Gap: Do Large Vision-Language Models Recognize and Respect Copyrighted Content?
Naen Xu, Jinghuai Zhang, Changjiang Li et al.
SafeR-CLIP: Mitigating NSFW Content in Vision-Language Models While Preserving Pre-Trained Knowledge
Adeel Yousaf, Joseph Fioresi, James Beetham et al.
Two Constraint Compilation Methods for Lifted Planning
Periklis Mantenoglou, Luigi Bonassi, Enrico Scala et al.
First-Order Representation Languages for Goal-Conditioned RL
Simon Ståhlberg, Hector Geffner
CoT-VLNBench: A Benchmark for Visual Chain-of-Thought Reasoning in Vision-Language-Navigation Robots
Xiao Zhao, Chang Liu, Ruiteng Ji et al.
Instance Dependent Testing of Samplers Using Interval Conditioning
Rishiraj Bhattacharyya, Sourav Chakraborty, Yash Pote et al.
Coarse-to-Fine Open-Set Graph Node Classification with Large Language Models
Xueqi Ma, Xingjun Ma, Sarah Monazam Erfani et al.
AURA: Affordance-Understanding and Risk-aware Alignment Technique for Large Language Models
Sayantan Adak, Pratyush Chatterjee, Somnath Banerjee et al.
ALPHA: Action-Based Learning for Pluralistic Human Alignment in Large Language Models
Aanisha Bhattacharyya, Susmit Agrawal, Yaman Kumar Singla et al.
MegaCoin: Enhancing Medium-Grained Color Perception for Vision-Language Models
Ming-Chang Chiu, Shicheng Wen, Pin-Yu Chen et al.
Democratizing Diplomacy: A Harness for Evaluating Any Large Language Model on Full-Press Diplomacy
Alexander Duffy, Samuel J Paech, Ishana Shastri et al.
EssayBench: Evaluating Large Language Models in Multi-Genre Chinese Essay Writing
Fan Gao, Dongyuan Li, Ding Xia et al.
Align to Structure: Aligning Large Language Models with Structural Information
Zae Myung Kim, Anand Ramachandran, Farideh Tavazoee et al.
StyleBreak: Revealing Alignment Vulnerabilities in Large Audio-Language Models via Style-Aware Audio Jailbreak
Hongyi Li, Chengxuan Zhou, Chu Wang et al.