Papers
617 papers found
Accelerating Design Space Exploration for LLM Training Systems with Multi-experiment Parallel Simulation
Fei Gui, Kaihui Gao, Li Chen et al.
Achieving Wire-Latency Storage Systems by Exploiting Hardware ACKs
Qing Wang, Jiwu Shu, Jing Wang et al.
A Layered Formal Methods Approach to Answering Queue-related Queries
Divya Raghunathan, Maria Apostolaki, Aarti Gupta
AsTree: An Audio Subscription Architecture Enabling Massive-Scale Multi-Party Conferencing
Tong Meng, Wenfeng Li, Chao Yuan et al.
AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training
Guanbin Xu, Zhihao Le, Yinhe Chen et al.
Beehive: A Scalable Disaggregated Memory Runtime Exploiting Asynchrony of Multithreaded Programs
Quanxi Li, Hong Huang, Ying Liu et al.
BFTBrain: Adaptive BFT Consensus with Reinforcement Learning
Chenyuan Wu, Haoyun Qin, Mohammad Javad Amiri et al.
Building an Elastic Block Storage over EBOFs Using Shadow Views
Sheng Jiang, Ming Liu
Building Massive MIMO Baseband Processing on a Single-Node Supercomputer
Xincheng Xie, Wentao Hou, Zerui Guo et al.
ByteCheckpoint: A Unified Checkpointing System for Large Foundation Model Development
Borui Wan, Mingji Han, Yiyao Sheng et al.
CATO: End-to-End Optimization of ML-Based Traffic Analysis Pipelines
Gerry Wan, Shinan Liu, Francesco Bronzino et al.
CEGS: Configuration Example Generalizing Synthesizer
Jianmin Liu, Li Chen, Dan Li et al.
CellReplay: Towards accurate record-and-replay for cellular networks
William Sentosa, Balakrishnan Chandrasekaran, P. Brighten Godfrey et al.
ClubHeap: A High-Speed and Scalable Priority Queue for Programmable Packet Scheduling
Zhikang Chen, Haoyu Song, Zhiyu Zhang et al.
DISC: Backpressure Mitigation In Multi-tier Applications With Distributed Shared Connection
Brice Ekane, Djob Mvondo, Renaud Lachaize et al.
Dissecting and Streamlining the Interactive Loop of Mobile Cloud Gaming
Yang Li, Jiaxing Qiu, Hongyi Wang et al.
Eden: Developer-Friendly Application-Integrated Far Memory
Anil Yelam, Stewart Grant, Saarth Deshpande et al.
Efficient Direct-Connect Topologies for Collective Communications
Liangyu Zhao, Siddharth Pal, Tapan Chugh et al.
Efficient Multi-WAN Transport for 5G with OTTER
Mary Hogan, Gerry Wan, Yiming Qiu et al.
Enabling Portable and High-Performance SmartNIC Programs with Alkali
Jiaxin Lin, Zhiyuan Guo, Mihir Shah et al.
Enabling Silent Telemetry Data Transmission with InvisiFlow
Yinda Zhang, Liangcheng Yu, Gianni Antichi et al.
Enhancing Network Failure Mitigation with Performance-Aware Ranking
Pooria Namyar, Arvin Ghavidel, Daniel Crankshaw et al.
eTran: Extensible Kernel Transport with eBPF
Zhongjie Chen, Qingkai Meng, ChonLam Lao et al.
Everything Matters in Programmable Packet Scheduling
Albert Gran Alcoz, Balázs Vass, Pooria Namyar et al.
Evolution of Aegis: Fault Diagnosis for AI Model Training Service in Production
Jianbo Dong, Kun Qian, Pengcheng Zhang et al.