2024 ICML ICML 2024

HexGen: Generative Inference of Large Language Model over Heterogeneous Environment