Papers
22,524 papers found
Multi-Agent Reinforcement Learning for Modeling, Simulating, and Optimizing Energy Markets
Matan Levy, Itay Segev, Alexander Tuisov et al.
Rescind: Countering Image Misconduct in Biomedical Publications with Vision-Language and State-Space Modeling
Soumyaroop Nandi, Prem Natarajan
PlantTraitNet: An Uncertainty-Aware Multimodal Framework for Global-Scale Plant Trait Inference from Citizen Science Data
Ayushi Sharma, Johanna Trost, Daniel Lusk et al.
OIDA-QA: A Multimodal Benchmark for Analyzing the Opioid Industry Documents Archive
Xuan Shen, Brian Wingenroth, Zichao Wang et al.
Talk, Snap, Complain: Validation-Aware Multimodal Expert Framework for Fine-Grained Customer Grievances
Rishu Kumar Singh, Navneet Shreya, Sarmistha Das et al.
EgoEMS: A High-Fidelity Multimodal Egocentric Dataset for Cognitive Assistance in Emergency Medical Services
Keshara Weerasinghe, Xueren Ge, Tessa Heick et al.
Explainable Oracle Bone Script Recognition via Multimodal Pictographic Reasoning
Yin Wu, Zhengxuan Zhang, Jiayu Chen et al.
Complex Reasoning over Vision and Language --Leveraging Neurosymbolic AI
Parisa Kordjamshidi
10 Open Challenges Steering the Future of Vision-Language-Action Models
Soujanya Poria, Navonil Majumder, Chia-Yu Hung et al.
Towards Trustworthy Multimodal AI Systems
Chirag Agarwal
From Natural Language to Executable ETL Flows: The IBM DataStage Assistant
Nitin Gupta, Thomas Gschwind, Shramona Chakraborty et al.
Building Domain-Specific Small Language Models via Guided Data Generation
Aman Kumar, Ekant Muljibhai Amin, Xian Yeow Lee et al.
PeerCoPilot: A Language Model-Powered Assistant for Behavioral Health Organizations
Gao Mo, Naveen Janaki Raman, Megan Chai et al.
Automated Unified Reasoning with Vision-Language Models for Multi-modal Burn Assessment
Md Masudur Rahman, Mohamed El Masry, Gayle Gordillo et al.
GAICo: A Deployed and Extensible Framework for Evaluating Diverse and Multimodal Generative AI Outputs
Nitin Gupta, Pallav Koppisetti, Kausik Lakkaraju et al.
SlideBot: A Multi-Agent Framework for Generating Informative, Reliable, Multi-Modal Presentations
Eric Xie, Danielle Waterfield, Michael Kennedy et al.
Multimodal Tabular Data Learning
Jun-Peng Jiang
Multi-Robot Learning from Human Feedback
Connor Mattson
Fusing Time-Domain and Constellation Views: A Multimodal MAE for Wireless Signals (Student Abstract)
Agniva Banerjee, Arijit Sen
IMPACT: Integrated Multimodal Pipeline for Rapid Accident Causality Tracking (Student Abstract)
Vashu Chauhan, Avinash Anand, Manisha Luthra et al.
Multimodal Coarse-to-Local Transformer for End-to-End Autonomous Driving (Student Abstract)
Yeryeong Cho, Joongheon Kim