Papers
16,557 papers found
SPA: Achieving Consensus in LLM Alignment via Self-Priority Optimization
Yue Huang, Xiangqi Wang, Xiangliang Zhang
Backdooring Rationalization
Lingxiao Kong, Jiahui Jiang, Wenchao Xu et al.
KVmix: Gradient-Based Layer Importance-Aware Mixed-Precision Quantization for KV Cache
Fei Li, Song Liu, Weiguo Wu et al.
AdaFuse: Accelerating Dynamic Adapter Inference via Token-Level Pre-Gating and Fused Kernel Optimization
Qiyang Li, Rui Kong, Yuchen Li et al.
DMOSpeech 2: Reinforcement Learning for Duration Prediction in Metric-Optimized Speech Synthesis
Yinghao Aaron Li, Xilin Jiang, Fei Tao et al.
Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning
Sirui Liang, Pengfei Cao, Jian Zhao et al.
SparseRM: A Lightweight Preference Modeling with Sparse Autoencoder
Dengcan Liu, Jiahao Li, Zheren Fu et al.
Judge Q: Trainable Queries for Optimized Information Retention in KV Cache Eviction
Yijun Liu, Yixuan Wang, Yuzhuang Xu et al.
InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization
Yuhang Liu, Zeyu Liu, Shuanghe Zhu et al.
SlimInfer: Accelerating Long-Context LLM Inference via Dynamic Token Pruning
Lingkun Long, Rubing Yang, Yushi Huang et al.
URPO: A Unified Reward & Policy Optimization Framework for Large Language Models
Songshuo Lu, Hua Wang, Zhi Chen et al.
Better Datasets Start from RefineLab: Automatic Optimization for High-Quality Dataset Refinement
Xiaonan Luo, Yue Huang, Ping He et al.
QueryAligner: Customizing User Query to Match LLMs Preferences for Better Intent Recognition
Yunlong Ma, Bo Wang, Yihong Tang et al.
Inference-Aware Prompt Optimization for Aligning Black-Box Large Language Models
Saaduddin Mahmud, Mason Nakamura, Kyle Hollins Wray et al.
RefRea: Reference-Guided Reasoning with Meta-Cognition for Accurate Language Model Agents
Yuxiang Mai, Qiyue Yin, Wancheng Ni et al.
Confidence Estimation for Text-to-SQL in Large Language Models
Sepideh Entezari Maleki, Mohammadreza Pourreza, Davood Rafiei
TokenPowerBench: Benchmarking the Power Consumption of LLM Inference
Chenxu Niu, Wei Zhang, Jie Li et al.
SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling
Md Imbesat Hassan Rizvi, Xiaodan Zhu, Iryna Gurevych
Optimization and Robustness-Informed Membership Inference Attacks for LLMs
Zichen Song, Qixin Zhang, Ming Li et al.
ECD: Evidence-guided Contrastive Decoding in Retrieval-Augmented Generation with Accurate Knowledge Reference Adjustment
Yize Sui, Yan Xu, Kun Hu et al.
CO-Bench: Benchmarking Language Model Agents in Algorithm Search for Combinatorial Optimization
Weiwei Sun, Shengyu Feng, Shanda Li et al.
Well Begun, Half Done: Reinforcement Learning with Prefix Optimization for LLM Reasoning
Yiliu Sun, Zicheng Zhao, Yang Wei et al.
Improving the Accuracy of Dense Retrieval on the Quantized Indexes via Gradient Optimization of the Target Embeddings
Cong Tan, Yongqi Shao, Hong Huo et al.
Put the Space of LoRA Initialization to the Extreme to Preserve Pre-trained Knowledge
Pengwei Tang, Xiaolin Hu, Yong Liu et al.
Rectify Evaluation Preference: Improving LLMs’ Critique on Math Reasoning via Perplexity-aware Reinforcement Learning
Changyuan Tian, Zhicong Lu, Shuang Qian et al.