Tiered Memory Management Beyond Hotness

Jinshu Liu; Hamid Hadian; Hanchen Xu; Huaicheng Li

2025 OSDI OSDI 2025

Tiered Memory Management Beyond Hotness

Abstract

Tiered memory systems often rely on access frequency (''hotness'') to guide data placement. However, hot data is not always performance-critical, limiting the effectiveness of hotness-based policies. We introduce amortized offcore latency (AOL), a novel metric that precisely captures the true performance impact of memory accesses by accounting for memory access latency and memory-level parallelism (MLP). Leveraging AOL, we present two powerful tiering mechanisms: SOAR, a profile-guided allocation policy that places objects based on their performance contribution, and ALTO, a lightweight page migration regulation policy to eliminate unnecessary migrations. SOAR and ALTO outperform four state-of-the-art tiering designs across a diverse set of workloads by up to 12.4×, while underperforming in a few cases by no more than 3%.

🌉 Interdisciplinary Bridge — Computer Science and Machine Learning

🧭 Keyword Pioneer — tiered memory management

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning

Authors

Jinshu Liu , Hamid Hadian , Hanchen Xu , Huaicheng Li

Topics

Machine Learning > Application Areas > Efficient Computing Computer Science > Systems > Operating Systems

Keywords

performance optimization page migration tiered memory management memory latency memory-level parallelism

Download PDF

Related papers

OS Rendering Service Made Parallel with Out-of-Order Execution and In-Order Commit 2025

Deriving Semantic Checkers from Tests to Detect Silent Failures in Production Distributed Systems 2025

FineMem: Breaking the Allocation Overhead vs. Memory Waste Dilemma in Fine-Grained Disaggregated Memory Management 2025

Tigon: A Distributed Database for a CXL Pod 2025

Scalio: Scaling up DPU-based JBOF Key-value Store with NVMe-oF Target Offload 2025