Papers
19,135 papers found
Time-Frequency Token Advantage Clipping for Training Efficient Large Reasoning Model
Rong Bao, Bo Wang, Xiao Wang et al.
JudgeBoard: Benchmarking and Enhancing Small Language Models for Reasoning Evaluation
Zhenyu Bi, Gaurav Srivastava, Yang Li et al.
Sampling-Free Uncertainty Quantification via Hidden State Dynamics in Language Models
Yixin Bu, Guanyun Zou, Renzhi Wang et al.
TIV: Thought Injection via Vectors for Efficient Reasoning in Large Reasoning Models
Yi Cao, Weijie Shi, Wei-Jie Xu et al.
Skill Path: Unveiling Language Skills from Circuit Graphs
Hang Chen, Xinyu Yang, Jiaying Zhu et al.
HLPD: Aligning LLMs to Human Language Preference for Machine-Revised Text Detection
Fangqi Dai, Xingjian Jiang, Zizhuang Deng
AdaSpec: Adaptive Multilingual Speculative Decoding with Self-Synthesized Language-Aware Training and Vocabulary Simplification
Dinh-Truong Do, Nguyen-Khang Le, Le-Minh Nguyen
MHA2MLA-VLM: Enabling DeepSeek’s Economical Multi-Head Latent Attention Across Vision-Language Models
Xiaoran Fan, Zhichao Sun, Tao Ji et al.
Graph-of-Mark: Promote Spatial Reasoning in Multimodal Language Models with Graph-Based Visual Prompting
Giacomo Frisoni, Lorenzo Molfetta, Mattia Buzzoni et al.
Harnessing the Unseen: The Hidden Influence of Intrinsic Knowledge in Long-Context Language Models
Yu Fu, Haz Sameen Shahgir, Hui Liu et al.
MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools
Zikang Guo, Benfeng Xu, Chiwei Zhu et al.
End-to-End Contrastive Language-Speech Pretraining Model for Long-Form Spoken Question Answering
Jiliang Hu, Zuchao Li, Baoyuan Qi et al.
DUP: Detection-guided Unlearning for Backdoor Purification in Language Models
Man Hu, Yahui Ding, Yatao Yang et al.
HAPO: Training Language Models to Reason Concisely via History-Aware Policy Optimization
Chengyu Huang, Zhengxin Zhang, Claire Cardie
QiMeng-CRUX: Narrowing the Gap Between Natural Language and Verilog via Core Refined Understanding eXpression
Lei Huang, Rui Zhang, Jiaming Guo et al.
Do Language Models Associate Sound with Meaning? A Multimodal Study of Sound Symbolism
Jinhong Jeong, Sunghyun Lee, Jaeyoung Lee et al.
LC3: Long Cross-Language Code Clone Detection Enhanced by Opcode Sequences and Affinity Aggregation
Xilin Lan, Huan Zhang, Yang Yang et al.
Language Drift in Multilingual Retrieval-Augmented Generation: Characterization and Decoding-Time Mitigation
Bo Li, Zhenghua Xu, Rui Xie
From Sampling to Cognition: Modeling Internal Cognitive Confidence in Language Models for Robust Uncertainty Calibration
Hao Li, Tao He, Jiafeng Liang et al.
From Macro to Micro: Probing Dataset Diversity in Language Model Fine-Tuning
Haoyu Li, Xuhong Li, Yiming Dong et al.
TransMamba: A Sequence-Level Hybrid Transformer-Mamba Language Model
Yixing Li, Ruobing Xie, Zhen Yang et al.
DRIFT: Difference-Aware Reinforcement Through Iterative Fine-Tuning for Language Model
Wenjie Liao, Xiaohui Song, Haonan Lu
Is Your (Reasoning) Multimodal Language Model Vulnerable Toward Distractions?
Ming Liu, Hao Chen, Jindong Wang et al.
SafeNLIDB: A Privacy-Preserving Safety Alignment Framework for LLM-based Natural Language Database Interfaces
Ruiheng Liu, Xiaobing Chen, Jinyu Zhang et al.
RefRea: Reference-Guided Reasoning with Meta-Cognition for Accurate Language Model Agents
Yuxiang Mai, Qiyue Yin, Wancheng Ni et al.