Papers
9,944 papers found
What to Ask Next? Probing the Imaginative Reasoning of LLMs with TurtleSoup Puzzles
Mengtao Zhou, Sifan Wu, Huan Zhang et al.
In-Token Rationality Optimization: Towards Accurate and Concise LLM Reasoning via Self-Feedback
Mingye Zhu, Yi Liu, Zheren Fu et al.
Outlier Matters: Efficient Long-to-Short Reasoning via Outlier-Guided Model Merging
Qiyuan Zhu, Dezhi Li, Lujun Li et al.
ExtendAttack: Attacking Servers of LRMs via Extending Reasoning
Zhenhao Zhu, Yue Liu, Zhiwei Xu et al.
When Safe Unimodal Inputs Collide: Optimizing Reasoning Chains for Cross-Modal Safety in Multimodal Large Language Models
Wei Cai, Shujuan Liu, Jian Zhao et al.
MedOmni-45°: A Safety–Performance Benchmark for Reasoning-Oriented LLMs in Medicine
Kaiyuan Ji, Yijin Guo, Zicheng Zhang et al.
SPAN: Benchmarking and Improving Cross-Calendar Temporal Reasoning of Large Language Models
Zhongjian Miao, Hao Fu, Chen Wei
Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems
Haowei Wang, Rupeng Zhang, Junjie Wang et al.
MCPTox: A Benchmark for Tool Poisoning on Real-World MCP Servers
Zhiqiang Wang, Yichao Gao, Yanting Wang et al.
LexChain: Modeling Legal Reasoning Chains for Chinese Tort Case Analysis
Huiyuan Xie, Chenyang Li, Huining Zhu et al.
The Emotional Baby Is Truly Deadly: Does Your Multimodal Large Reasoning Model Have Emotional Flattery Towards Humans?
Yuan Xun, Xiaojun Jia, Xinwei Liu et al.
Reason2Attack: Jailbreaking Text-to-Image Models via LLM Reasoning
Chenyu Zhang, Lanjun Wang, Yiwen Ma et al.
Elite Pattern Reinforcement for Vehicle Routing Problems
Ning Li, Peng Lin, Peng Zhang et al.
Efficient Solution and Learning of Robust Factored MDPs
Yannik Schnitzer, Alessandro Abate, David Parker
HTN Plan Verification by Qualitative Temporal Reasoning
Tobias Schwartz, Diedrich Wolter
History-Aware Reasoning for GUI Agents
Ziwei Wang, Leyang Yang, Xiaoxuan Tang et al.
EvoEmpirBench: Dynamic Spatial Reasoning with Agent-ExpVer
Pukun Zhao, Longxiang Wang, Miaowei Wang et al.
CoT-VLNBench: A Benchmark for Visual Chain-of-Thought Reasoning in Vision-Language-Navigation Robots
Xiao Zhao, Chang Liu, Ruiteng Ji et al.
CRAF: A Clinical Reasoning-Adaptive Framework via Reinforcement Learning for Similar Case Retrieval
Jie Lin, Lei Jiang, Zongyi Chen et al.
DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs
Oluwanifemi Bamgbose, Masoud Hashemi, Sathwik Tejaswi Madhusudhan et al.
Deep Hidden Cognition Facilitates Reliable Chain-of-Thought Reasoning
Zijun Chen, Wenbo Hu, Richang Hong
Unintended Misalignment from Agentic Fine-Tuning: Risks and Mitigation
Dongyoon Hahm, Taywon Min, Woogyeol Jin et al.
Cost-Minimized Label-Flipping Poisoning Attack to LLM Alignment
Shigeki Kusaka, Keita Saito, Mikoto Kudo et al.
STELAR-VISION: Self-Topology-Aware Efficient Learning for Aligned Reasoning in Vision
Chen Li, Han Zhang, Zhantao Yang et al.