Fast Spatio-Temporal Residual Network for Video Super-Resolution

Sheng Li; Fengxiang He; Bo Du; Lefei Zhang; Yonghao Xu; Dacheng Tao

2019 CVPR CVPR 2019

Fast Spatio-Temporal Residual Network for Video Super-Resolution

Abstract

Recently, deep learning based video super-resolution (SR) methods have achieved promising performance. To simultaneously exploit the spatial and temporal information of videos, employing 3-dimensional (3D) convolutions is a natural approach. However, straight utilizing 3D convolutions may lead to an excessively high computational complexity which restricts the depth of video SR models and thus undermine the performance. In this paper, we present a novel fast spatio-temporal residual network (FSTRN) to adopt 3D convolutions for the video SR task in order to enhance the performance while maintaining a low computational load. Specifically, we propose a fast spatio-temporal residual block (FRB) that divide each 3D filter to the product of two 3D filters, which have considerably lower dimensions. Furthermore, we design a cross-space residual learning that directly links the low-resolution space and the high-resolution space, which can greatly relieve the computational burden on the feature fusion and up-scaling parts. Extensive evaluations and comparisons on benchmark datasets validate the strengths of the proposed approach and demonstrate that the proposed network significantly outperforms the current state-of-the-art methods.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — spatio-temporal residual network

🐣 Hot Topic Early Bird — video processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Sheng Li , Fengxiang He , Bo Du , Lefei Zhang , Yonghao Xu , Dacheng Tao

Topics

Machine Learning > Optimization & Theory > Optimization Deep Learning > Architectures > Neural Networks Computer Vision > Processing > Video Processing Deep Learning > Learning Types > Representation Learning Deep Learning > Architectures > Convolutional Neural Networks

Keywords

image restoration video super-resolution computational complexity video processing 3d convolution residual learning spatio-temporal residual network cross-space residual learning

Download PDF

Related papers

Fast Single Image Reflection Suppression via Convex Optimization 2019

Learning Video Representations From Correspondence Proposals 2019

ATOM: Accurate Tracking by Overlap Maximization 2019

Visual Tracking via Adaptive Spatially-Regularized Correlation Filters 2019

Edge-Labeling Graph Neural Network for Few-Shot Learning 2019