A Spatial RNN Codec for End-to-End Image Compression

Chaoyi Lin; Jiabao Yao; Fangdong Chen; Li Wang

2020 CVPR CVPR 2020

A Spatial RNN Codec for End-to-End Image Compression

Abstract

Recently, deep learning has been explored as a promising direction for image compression. Removing the spatial redundancy of the image is crucial for image compression and most learning based methods focus on removing the redundancy between adjacent pixels. Intuitively, to explore larger pixel range beyond adjacent pixel is beneficial for removing the redundancy. In this paper, we propose a fast yet effective method for end-to-end image compression by incorporating a novel spatial recurrent neural network. Block based LSTM is utilized to remove the redundant information between adjacent pixels and blocks. Besides, the proposed method is a potential efficient system that parallel computation on individual blocks is possible. Experimental results demonstrate that the proposed model outperforms state-of-the-art traditional image compression standards and learning based image compression models in terms of both PSNR and MS-SSIM metrics. It provides a 26.73% bits-reduction than High Efficiency Video Coding (HEVC), which is the current official state-of-the-art video codec.

🌉 Interdisciplinary Bridge — Computer Science and Computer Vision and Deep Learning

🧭 Keyword Pioneer — block lstm

🐣 Hot Topic Early Bird — image compression

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Chaoyi Lin , Jiabao Yao , Fangdong Chen , Li Wang

Topics

Deep Learning > Architectures > Neural Networks Deep Learning > Techniques > Model Architecture Computer Vision > Processing > Image Processing Computer Science > Applications > Computer Vision Deep Learning > Optimization & Theory > Efficient Computing Deep Learning > Architectures > Recurrent Neural Networks

Keywords

image compression recurrent neural network end-to-end learning spatial redundancy block lstm

Download PDF

Related papers

Deep Polarization Cues for Transparent Object Segmentation 2020

HRank: Filter Pruning Using High-Rank Feature Map 2020

Panoptic-Based Image Synthesis 2020

Select, Supplement and Focus for RGB-D Saliency Detection 2020

ClusterVO: Clustering Moving Instances and Estimating Visual Odometry for Self and Surroundings 2020