AdaNIC: Towards Practical Neural Image Compression via Dynamic Transform Routing

Lvfang Tao; Wei Gao; Ge Li; Chenhao Zhang

2023 ICCV ICCV 2023

AdaNIC: Towards Practical Neural Image Compression via Dynamic Transform Routing

Abstract

Compressive autoencoders (CAEs) play an important role in deep learning-based image compression, but large-scale CAEs are computationally expensive. We propose a framework with three techniques to enable efficient CAE-based image coding: 1) Spatially-adaptive convolution and normalization operators enable block-wise nonlinear transform to spend FLOPs unevenly across the image to be compressed, according to a transform capacity map. 2) Just-unpenalized model capacity (JUMC) optimizes the transform capacity of each CAE block via rate-distortion-complexity optimization, finding the optimal capacity for the source image content. 3) A lightweight routing agent model predicts the transform capacity map for the CAEs by approximating JUMC targets. By activating the best-sized sub-CAE inside the slimmable supernet, our approach achieves up to 40% computational speed-up with minimal BD-Rate increase, validating its ability to save computational resources in a content-aware manner.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — transform routing

🐣 Hot Topic Early Bird — rate-distortion optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Machine Learning, Mathematics & Optimization, Natural Language Processing

Authors

Lvfang Tao , Wei Gao , Ge Li , Chenhao Zhang

Topics

Machine Learning > Application Areas > Efficient Computing Deep Learning > Architectures > Autoencoders Computer Vision > Processing > Image Processing Deep Learning > Learning Types > Deep Learning

Keywords

rate-distortion optimization neural image compression compressive autoencoder transform routing slimmable supernet dynamic transform routing spatially-adaptive convolution

Download PDF

Related papers

PVT++: A Simple End-to-End Latency-Aware Visual Tracking Framework 2023

Periodically Exchange Teacher-Student for Source-Free Object Detection 2023

Stable and Causal Inference for Discriminative Self-supervised Deep Visual Representations 2023

Minimal Solutions to Uncalibrated Two-view Geometry with Known Epipoles 2023

3D Neural Embedding Likelihood: Probabilistic Inverse Graphics for Robust 6D Pose Estimation 2023