Papers
16,557 papers found
KeepKV: Achieving Periodic Lossless KV Cache Compression for Efficient LLM Inference
Yuxuan Tian, Zihan Wang, Yebo Peng et al.
CharBench: Evaluating the Role of Tokenization in Character-Level Tasks
Omri Uzan, Yuval Pinter
Incoherence as Oracle-less Measure of Error in LLM-Based Code Generation
Thomas Jean-Michel Valentin, Ardi Madadi, Gaetano Sapia et al.
Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models
Chenglong Wang, Yifu Huo, Yang Gan et al.
A Rolling Stone Gathers No Moss: Adaptive Policy Optimization for Stable Self-Evaluation in Large Multimodal Models
Wenkai Wang, Hongcan Guo, Zheqi Lv et al.
REFO: Reinforced Evolutionary Faithfulness Optimization for Large Language Models
Yi Wang, Xiaqiang Tang, Keyu Hu et al.
OptScale: Probabilistic Optimality for Inference-time Scaling
Youkang Wang, Jian Wang, Rubing Chen et al.
Eliciting Chain-of-Thought in Base LLMs via Gradient-Based Representation Optimization
Zijian Wang, Yanxiang Ma, Chang Xu
GlitchMiner: Mining Glitch Tokens in Large Language Models via Gradient-based Discrete Optimization
Zihui Wu, Haichang Gao, Ping Wang et al.
DeepOR: A Deep Reasoning Foundation Model for Optimization Modeling
Ziyang Xiao, Yuan Jessica Wang, Xiongwei Han et al.
Test-time Prompt Intervention
Chenxu Yang, Qingyi Si, Mz Dai et al.
Language Model Distillation: A Temporal Difference Imitation Learning Perspective
Zishun Yu, Shangzhe Li, Xinhua Zhang
Expert-Inspired Multi-Agent Coordination for Multi-Objective Molecular Optimization
Daojian Zeng, Tianle Li, Jiahao Yang et al.
JELV: A Judge of Edit-Level Validity for Evaluation and Automated Reference Expansion in Grammatical Error Correction
Yuhao Zhan, Yuqing Zhang, Jing Yuan et al.
Multi-Metric Preference Alignment for Generative Speech Restoration
Junan Zhang, Xueyao Zhang, Jing Yang et al.
Beyond Step Pruning: Information Theory Based Step-level Optimization for Self-Refining Large Language Models
Jinman Zhao, Erxue Min, Hui Wu et al.
Learning from Guidelines: Structured Prompt Optimization for Expert Annotation Tasks
Wenliang Zhong, Haiqing Li, Thao M. Dang et al.
ResMAS: Resilience Optimization in LLM-based Multi-agent Systems
Zhilun Zhou, Zihan Liu, Jiahe Liu et al.
In-Token Rationality Optimization: Towards Accurate and Concise LLM Reasoning via Self-Feedback
Mingye Zhu, Yi Liu, Zheren Fu et al.
Your Prompts Are Not Safe: Output-Free Membership Inference via Prompt Vectors in Vision-Language Tuning
Yuran Bian, Xiaohan Zhang, Zhiyuan Yu et al.
Reference Recommendation Based Membership Inference Attack Against Hybrid-Based Recommender Systems
Xiaoxiao Chi, Xuyun Zhang, Yan Wang et al.
Private Frequency Estimation via Residue Number Systems
Héber Hwang Arcolezi
Eguard: Defending LLM Embeddings Against Inversion Attacks via Text Mutual Information Optimization
Tiantian Liu, Hongwei Yao, Feng Lin et al.
Dynamic Deep Prompt Optimization for Defending Against Jailbreak Attacks on LLMs
Doniyorkhon Obidov, Honggang Yu, Xiaolong Guo et al.
MPMA: Preference Manipulation Attack Against Model Context Protocol
Zihan Wang, Rui Zhang, Yu Liu et al.