Papers
16,557 papers found
Parameter-Free Clustering via Self-Supervised Consensus Maximization
Lijun Zhang, Suyuan Liu, Siwei Wang et al.
CauVQ: Causal Vector Quantization for Graph OOD Generalization
Weihong Zhang, Liang Bai, Hangyuan Du et al.
Length-Adaptive Interest Network for Balancing Long and Short Sequence Modeling in CTR Prediction
Zhicheng Zhang, Zhaocheng Du, Jieming Zhu et al.
Minimum-Length Conformal Prediction Sets for Ordinal Classification
Zijian Zhang, Xinyu Chen, Yuanjie Shi et al.
Differentiable Sparse Identification of Lagrangian Dynamics
Zitong Zhang, Hao Sun
TRACE: A Generalizable Drift Detector for Streaming Data-Driven Optimization
Yuan-Ting Zhong, Ting Huang, Xiaolin Xiao et al.
UniAPO: Unified Multimodal Automated Prompt Optimization
Qipeng Zhu, Yanzhe Chen, Huasong Zhong et al.
PSPO: Prompt-Level Prioritization and Experience-Weighted Smoothing for Efficient Policy Optimization
Xinxin Zhu, Ying He, Haowen Hou et al.
GDBA Revisited: Unleashing the Power of Guided Local Search for Distributed Constraint Optimization
Yanchen Deng, Xinrun Wang, Bo An
Safe Multi-Agent Reinforcement Learning via Distributional Safety Critic and Maximum Entropy Optimization
Qiwei Liu, Ye Yuan, Lingyue Zhang et al.
HCPO: Hierarchical Conductor-Based Policy Optimization in Multi-Agent Reinforcement Learning
Zejiao Liu, Junqi Tu, Yitian Hong et al.
MARPO: A Reflective Policy Optimization for Multi-Agent Reinforcement Learning
Cuiling Wu, Yaozhong Gan, Junliang Xing et al.
HiveMind: Contribution-Guided Online Prompt Optimization of LLM Multi-Agent Systems
Yihan Xia, Taotao Wang, Shengli Zhang et al.
Enhancing PIBT via Multi-Action Operations
Egor Yukhnevich, Anton Andreychuk
Where Norms and References Collide: Evaluating LLMs on Normative Reasoning
Mitchell Abrams, Kaveh Eskandari Miandoab, Felix Gervits et al.
IROTE: Human-like Traits Elicitation of Large Language Model via In-Context Self-Reflective Optimization
Yuzhuo Bai, Shitong Duan, Muhua Huang et al.
OptiHive: Ensemble Selection for LLM-Based Optimization via Statistical Modeling
Maxime Bouscary, Saurabh Amin
DCTR: Dual-Constraint Subgraph Optimization for Knowledge Graph-based Retrieval-Augmented Generation
Yukun Cao, Zirui Xu, Dongyang Li et al.
Improving Long-Context Summarization with Multi-Granularity Retrieval Optimization
Xueyu Chen, Kaitao Song, Zifan Song et al.
From Mathematical Reasoning to Code: Generalization of Process Reward Models in Test-Time Scaling
Zhengyu Chen, Yudong Wang, Teng Xiao et al.
HLPD: Aligning LLMs to Human Language Preference for Machine-Revised Text Detection
Fangqi Dai, Xingjian Jiang, Zizhuang Deng
TimeBill: Time-Budgeted Inference for Large Language Models
Qi Fan, An Zou, Yehan Ma
Group Causal Policy Optimization for Post-Training Large Language Models
Ziyin Gu, Jingyao Wang, Ran Zuo et al.
HAPO: Training Language Models to Reason Concisely via History-Aware Policy Optimization
Chengyu Huang, Zhengxin Zhang, Claire Cardie