Papers
41,667 papers found
Beyond Step Pruning: Information Theory Based Step-level Optimization for Self-Refining Large Language Models
Jinman Zhao, Erxue Min, Hui Wu et al.
SABER: Switchable and Balanced Training for Efficient LLM Reasoning
Kai Zhao, Yanjun Zhao, Jiaming Song et al.
M3UCD: A Multi-task Multimodal Metaphor Understanding Challenge Dataset for LLMs
Tianlong Zheng, Yating Yang, Rui Dong et al.
STaR: Sensitive Trajectory Regulation for Unlearning in Large Reasoning Models
Jingjing Zhou, Gaoxiang Cong, Li Su et al.
When Safe Unimodal Inputs Collide: Optimizing Reasoning Chains for Cross-Modal Safety in Multimodal Large Language Models
Wei Cai, Shujuan Liu, Jian Zhao et al.
ALTER: Asymmetric LoRA for Token-Entropy-Guided Unlearning of LLMs
Xunlei Chen, Jinyu Guo, Yuang Li et al.
Efficient, Secure, Differentially Private Deep Learning in the Two-Server Model
Jun Feng, Hong Sun, Pengfei Zhang et al.
MedOmni-45°: A Safety–Performance Benchmark for Reasoning-Oriented LLMs in Medicine
Kaiyuan Ji, Yijin Guo, Zicheng Zhang et al.
Cross-Modal Unlearning via Influential Neuron Path Editing in Multimodal Large Language Models
Kunhao Li, Wenhao Li, Di Wu et al.
Generic Adversarial Attack Framework Against Graph-based Vertical Federated Learning
Yimin Liu, Peng Jiang, Qi Liu et al.
Learning Vision-Based Neural Network Controllers with Semi-Probabilistic Safety Guarantees
Xinhang Ma, Junlin Wu, Hussein Sibai et al.
Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems
Haowei Wang, Rupeng Zhang, Junjie Wang et al.
GUIC: Certified Graph Unlearning with Individual Fairness Guarantees
Zichong Wang, Tongliang Liu, Wenbin Zhang
Robust Learning from Noisily Labeled Long-Tailed Data via Fairness Regularizer
Jiaheng Wei, Zhaowei Zhu, Gang Niu et al.
LexChain: Modeling Legal Reasoning Chains for Chinese Tort Case Analysis
Huiyuan Xie, Chenyang Li, Huining Zhu et al.
Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception
Yuankun Xie, Ruibo Fu, Xiaopeng Wang et al.
iSeal: Encrypted Fingerprinting for Reliable LLM Ownership Verification
Zixun Xiong, Gaoyi Wu, Qingyang Yu et al.
HalluClean: A Unified Framework to Combat Hallucinations in LLMs
Yaxin Zhao, Yu Zhang
Extreme Value Monte Carlo Tree Search for Classical Planning
Masataro Asai, Stephen Wissow
Universal Safety Controllers with Learned Prophecies
Bernd Finkbeiner, Niklas Metzger, Satya Prakash Nayak et al.
Symmetry-Aware Transformer Training for Automated Planning
Markus Fritzsche, Elliot Gestrin, Jendrik Seipp
Two Constraint Compilation Methods for Lifted Planning
Periklis Mantenoglou, Luigi Bonassi, Enrico Scala et al.
Efficient Solution and Learning of Robust Factored MDPs
Yannik Schnitzer, Alessandro Abate, David Parker