Papers
10,699 papers found
Benchmarking Trustworthiness in Multimodal LLMs for Video Understanding
Youze Wang, Zijun Chen, Ruoyu Chen et al.
Safe Multi-agent Reinforcement Learning with Natural Language Constraints
Ziyan Wang, Meng Fang, Tristan Tomilin et al.
MCA-Bench: A Multimodal Benchmark for Evaluating CAPTCHA Robustness Against VLM-based Attacks
Zonglin Wu, Yule Xue, Yaoyao Feng et al.
DAVSP: Safety Alignment for Large Vision-Language Models via Deep Aligned Visual Safety Prompt
Yitong Zhang, Jia Li, Liyi Cai et al.
On the Feasibility of Using MultiModal LLMs to Execute AR Social Engineering Attacks
Ting Bi, Chenghang Ye, Zheyu Yang et al.
SatSolarCast: A Flexible Framework for Multimodal Solar Irradiance Forecasting via Memory-Alignment Learning
Kuai Dai, Hui Su, Chengxing Zhai et al.
Fine-Grained Interpretation of Political Opinions in Large Language Models
Jingyu Hu, Mengyue Yang, Mengnan Du et al.
Crossing Borders: A Multimodal Challenge for Indian Poetry Translation and Image Generation
Sofia Jamil, Kotla Sai Charan, Sriparna Saha et al.
A Human-Centric Pipeline for Aligning Large Language Models with Chinese Medical Ethics
Haoan Jin, Han Ying, Jiacheng Ji et al.
TRACE: Textual Relevance Augmentation and Contextual Encoding for Multimodal Hate Detection
Girish A. Koushik, Helen Treharne, Aditya Joshi et al.
MHB: Medical Hallucination Benchmark for Large Language Models in Complex Clinical Tasks
Jianrong Lu, Junwei Liu, Xingyun Zheng et al.
CCD-Bench: Probing Cultural Conflict in Large Language Model Decision-Making
Hasibur Rahman, Hanan Salam
PlantTraitNet: An Uncertainty-Aware Multimodal Framework for Global-Scale Plant Trait Inference from Citizen Science Data
Ayushi Sharma, Johanna Trost, Daniel Lusk et al.
OIDA-QA: A Multimodal Benchmark for Analyzing the Opioid Industry Documents Archive
Xuan Shen, Brian Wingenroth, Zichao Wang et al.
Talk, Snap, Complain: Validation-Aware Multimodal Expert Framework for Fine-Grained Customer Grievances
Rishu Kumar Singh, Navneet Shreya, Sarmistha Das et al.
EgoEMS: A High-Fidelity Multimodal Egocentric Dataset for Cognitive Assistance in Emergency Medical Services
Keshara Weerasinghe, Xueren Ge, Tessa Heick et al.
Explainable Oracle Bone Script Recognition via Multimodal Pictographic Reasoning
Yin Wu, Zhengxuan Zhang, Jiayu Chen et al.
Investigating Social Bias Propagation in Federated Fine-tuning of Large Language Models
Jiaxu Zhao, Meng Fang, Mingze Zhong et al.
Multi-Armed Bandits Meet Large Language Models
Djallel Bouneffouf, Raphael Feraud
Towards Trustworthy Multimodal AI Systems
Chirag Agarwal
Multimodal Super-Resolution: Discovering Hidden Physics and Its Application to Fusion Plasmas (Abstract Reprint)
Azarakhsh Jalalvand, SangKyeun Kim, Jaemin Seo et al.