Papers

17,973 papers found
2025 EMNLP
2025 EMNLP
3LM: Bridging Arabic, STEM, and Code through Benchmarking
Basma El Amel Boussaha, Leen Al Qadi, Mugariya Farooq et al.
2025 EMNLP
3MDBench: Medical Multimodal Multi-agent Dialogue Benchmark
Ivan Sviridov, Amina Miftakhova, Artemiy Tereshchenko et al.
2025 EMNLP
2025 EMNLP
2025 EMNLP
2025 EMNLP
A Causal Lens for Evaluating Faithfulness Metrics
Kerem Zaman, Shashank Srivastava
2025 EMNLP
Accelerated Test-Time Scaling with Model-Free Speculative Sampling
Woomin Song, Saket Dingliwal, Sai Muralidhar Jayanthi et al.
2025 EMNLP
Accelerating LLM Reasoning via Early Rejection with Partial Reward Modeling
Seyyed Saeid Cheshmi, Azal Ahmad Khan, Xinran Wang et al.
2025 EMNLP
AccessEval: Benchmarking Disability Bias in Large Language Models
Srikant Panda, Amit Agarwal, Hitesh Laxmichand Patel
2025 EMNLP
ACEBench: A Comprehensive Evaluation of LLM Tool Usage
Chen Chen, Xinlong Hao, Weiwen Liu et al.
2025 EMNLP
ACING: Actor-Critic for Instruction Learning in Black-Box LLMs
Salma Kharrat, Fares Fourati, Marco Canini
2025 EMNLP
2025 EMNLP
A Comparison of Elementary Baselines for BabyLM
Rareș Păpușoi, Sergiu Nisioi
2025 EMNLP
2025 EMNLP
A Comprehensive Framework to Operationalize Social Stereotypes for Responsible AI Evaluations
Aida Mostafazadeh Davani, Sunipa Dev, Héctor Pérez-Urbina et al.
2025 EMNLP