Papers
26,909 papers found
Automatic Machine Translation Evaluation using Source Language Inputs and Cross-lingual Language Model
Kosuke Takahashi, Katsuhito Sudoh, Satoshi Nakamura
Multi-agent Communication meets Natural Language: Synergies between Functional and Structural Language Learning
Angeliki Lazaridou, Anna Potapenko, Olivier Tieleman
Extracting and Encoding: Leveraging Large Language Models and Medical Knowledge to Enhance Radiological Text Representation
Pablo Messina, Rene Vidal, Denis Parra et al.
Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference
Siyuan Wang, Dianyi Wang, Chengxing Zhou et al.
UMUTeam at SemEval-2025 Task 1: Leveraging Multimodal and Large Language Model for Identifying and Ranking Idiomatic Expressions
Ronghao Pan, Tomás Bernal - Beltrán, José Antonio García - Díaz et al.
Balancing User-Item Structure and Interaction with Large Language Models and Optimal Transport for Multimedia Recommendation
Haodong Li, Lianyong Qi, Weiming Liu et al.
Spatial and Temporal Language Understanding: Representation, Reasoning, and Grounding
Parisa Kordjamshidi, Qiang Ning, James Pustejovsky et al.
UMUTeam at SemEval-2025 Task 1: Leveraging Multimodal and Large Language Model for Identifying and Ranking Idiomatic Expressions
Ronghao Pan, Tomás Bernal - Beltrán, José Antonio García - Díaz et al.
Representations of Fact, Fiction and Forecast in Large Language Models: Epistemics and Attitudes
Meng Li, Michael Vrazitulis, David Schlangen
NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks
Fawaz Sammani, Tanmoy Mukherjee, Nikos Deligiannis
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Yu Zhang, Ron J. Weiss, Heiga Zen et al.
ViGLUE: A Vietnamese General Language Understanding Benchmark and Analysis of Vietnamese Language Models
Minh-Nam Tran, Phu-Vinh Nguyen, Long Nguyen et al.
Doing Experiments and Revising Rules with Natural Language and Probabilistic Reasoning
Wasu Top Piriyakulkij, Cassidy Langenfeld, Tuan Anh Le et al.
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Gengze Zhou, Yicong Hong, Qi Wu
Align and Prompt: Video-and-Language Pre-Training With Entity Prompts
Dongxu Li, Junnan Li, Hongdong Li et al.
Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding
Morris Alper, Michael Fiman, Hadar Averbuch-Elor
A New Path: Scaling Vision-and-Language Navigation With Synthetic Instructions and Imitation Learning
Aishwarya Kamath, Peter Anderson, Su Wang et al.
Towards Surveillance Video-and-Language Understanding: New Dataset Baselines and Challenges
Tongtong Yuan, Xuange Zhang, Kun Liu et al.
Context-Aware Integration of Language and Visual References for Natural Language Tracking
Yanyan Shao, Shuting He, Qi Ye et al.
Effect of Visual Extensions on Natural Language Understanding in Vision-and-Language Models
Taichi Iki, Akiko Aizawa
IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models
Haoxuan You, Rui Sun, Zhecan Wang et al.
Experiences with Shared Resources for Research and Education in Speech and Language Processing
Rebecca Bates, Eric Fosler-Lussier, Florian Metze et al.
Learning to Detect Bipolar Disorder and Borderline Personality Disorder with Language and Speech in Non-Clinical Interviews
Bo Wang, Yue Wu, Niall Taylor et al.