2024 NIPS NeurIPS 2024

Practical Shuffle Coding

Abstract

We present a general method for lossless compression of unordered data structures, including multisets and graphs. It is a variant of shuffle coding that is many orders of magnitude faster than the original and enables 'one-shot' compression of single unordered objects. Our method achieves state-of-the-art compression rates on various large-scale network graphs at speeds of megabytes per second, efficiently handling even a multi-gigabyte plain graph with one billion edges. We release an implementation that can be easily adapted to different data types and statistical models.

🧭 Keyword Pioneer — shuffle coding
🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Deep Learning, Machine Learning, Mathematics & Optimization, Natural Language Processing
🌉 Interdisciplinary Bridge — Computer Science and Deep Learning and Machine Learning and Mathematics & Optimization