Efficient Scheduling Policies for Microsecond-Scale Tasks

Sarah McClure; Amy Ousterhout; Scott Shenker; Sylvia Ratnasamy

2022 NSDI NSDI 2022

Efficient Scheduling Policies for Microsecond-Scale Tasks

Abstract

Datacenter operators today strive to support microsecond-latency applications while also using their limited CPU resources as efficiently as possible. To achieve this, several recent systems allow multiple applications to run on the same server, granting each a dedicated set of cores and reallocating cores across applications over time as load varies. Unfortunately, many of these systems do a poor job of navigating the tradeoff between latency and efficiency, sacrificing one or both, especially when handling tasks as short as 1μs. While the implementations of these systems (threading libraries, network stacks, etc.) have been heavily optimized, the policy choices that they make have received less scrutiny. Most systems implement a single choice of policy for allocating cores across applications and for load-balancing tasks across cores within an application. In this paper, we use simulations to compare these different policy options and explore which yield the best combination of latency and efficiency. We conclude that work stealing performs best among load-balancing policies, multiple policies can perform well for core allocations, and, surprisingly, static core allocations often outperform reallocation with small tasks. We implement the best-performing policy choices by building on Caladan, an existing core-allocating system, and demonstrate that they can yield efficiency improvements of up to 13-22% without degrading (median or tail) latency.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — work stealing

🐝 Cross-Pollinator — Artificial Intelligence, Deep Learning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Sarah McClure , Amy Ousterhout , Scott Shenker , Sylvia Ratnasamy

Topics

Machine Learning > Application Areas > Efficient Computing Mathematics & Optimization > Optimization > Online Algorithms Computer Science > Systems > Distributed Systems Computer Science > Systems > Operating Systems

Keywords

task scheduling load balancing latency optimization scheduling policy work stealing core allocation datacenter efficiency

Download PDF

Related papers

Starlight: Fast Container Provisioning on the Edge and over the WAN 2022

PowerTCP: Pushing the Performance Limits of Datacenter Networks 2022

Gearbox: A Hierarchical Packet Scheduler for Approximate Weighted Fair Queuing 2022

Characterizing Physical-Layer Transmission Errors in Cable Broadband Networks 2022

Orca: Server-assisted Multicast for Datacenter Networks 2022