2025
ICML
ICML 2025
CateKV: On Sequential Consistency for Long-Context LLM Inference Acceleration
Authors
Haoyun Jiang
,
Haolin Li
,
Jianwei Zhang
,
Fei Huang
,
Qiang Hu
,
Minmin Sun
,
Shuai Xiao
,
Yong Li
,
Junyang Lin
,
Jiangchao Yao