Papers
10,699 papers found
Towards Multimodal Question Answering in Educational Domain
Himanshu Wadhwa, T Karthikeyan, Mausam et al.
Commentary Generation from Multimodal Game Data for Esports Moments in Multiplayer Strategy Games
Zihan Wang, Naoki Yoshinaga
PosterSum: A Multimodal Benchmark for Scientific Poster Summarization
Rohit Saxena, Pasquale Minervini, Frank Keller
Mixed Signals: Understanding Model Disagreement in Multimodal Empathy Detection
Maya Srikanth, Run Chen, Julia Hirschberg
Can a Unimodal Language Agent Provide Preferences to Tune a Multimodal Vision-Language Model?
Sazia Tabasum Mim, Jack Morris, Manish Dhakal et al.
mmJEE-Eval: A Bilingual Multimodal Benchmark for Evaluating Scientific Reasoning in Vision-Language Models
Arka Mukherjee, Shreya Ghosh
Zero-Shot Multi-Label Classification of Bangla Documents: Large Decoders Vs. Classic Encoders
Souvika Sarkar, Md Najib Hasan, Santu Karmaker
Benchmarking Large Language Models on Bangla Dialect Translation and Dialectal Sentiment Analysis
Md Mahir Jawad, Rafid Ahmed, Ishita Sur Apan et al.
Mātṛkā: Multilingual Jailbreak Evaluation of Open-Source Large Language Models
Murali Emani, Kashyap Manjusha R
Task-Aware Evaluation and Error-Overlap Analysis for Large Language Models
Pranava Madhyastha
Language Confusion and Multilingual Performance: A Case Study of Thai-Adapted Large Language Models
Pakhapoom Sarapat, Trapoom Ukarapol, Tatsunori Hashimoto
Test Set Quality in Multilingual LLM Evaluation
Chalamalasetti Kranti, Gabriel Bernier-Colborne, Yvan Gauthier et al.
Grahak-Nyay: Consumer Grievance Redressal through Large Language Models
Shrey Ganatra, Swapnil Bhattacharyya, Harshvivek Kashid et al.
MOD-KG: MultiOrgan Diagnosis Knowledge Graph
Anas Anwarul Haq Khan, Pushpak Bhattacharyya
QCNN-MFND: A Novel Quantum CNN Framework for Multimodal Fake News Detection in Social Media
Arya Suneesh, Balasubramanian Palani
Human-Centered Disability Bias Detection in Large Language Models
Habiba Chakour, Fatiha Sadat
Automated Telescope-Paper Linkage via Multi-Model Ensemble Learning
Ojaswa Ojaswa Varshney, Prashasti Vyas, Priyanka Goyal et al.
Enhanced Table Structure Recognition with Multi-Modal Approach
Huichen Yang, Andrew D. Hellicar, Maciej Rybinski et al.
Findings of WAT2025 English-to-Indic Multimodal Translation Task
Shantipriya Parida, Ondřej Bojar
Does Vision Still Help? Multimodal Translation with CLIP-Based Image Selection
Deepak Kumar, Baban Gain, Kshetrimayum Boynao Singh et al.
A Picture is Worth a Thousand (Correct) Captions: A Vision-Guided Judge-Corrector System for Multimodal Machine Translation
Siddharth Betala, Kushan Raj, Vipul Betala et al.
Augmenting Sign Language Translation Datasets with Large Language Models
Pedro Alejandro Dal Bianco, Jean Paul Nunes Reinhold, Facundo Manuel Quiroga et al.
Multilingual Sign Language Translation with Unified Datasets and Pose-Based Transformers
Pedro Alejandro Dal Bianco, Oscar Agustín Stanchi, Facundo Manuel Quiroga et al.
7 Points to Tsinghua but 10 Points to 清华? Assessing Large Language Models in Agentic Multilingual National Bias
Qianying Liu, Katrina Qiyao Wang, Fei Cheng et al.