LiteEval: A Coarse-to-Fine Framework for Resource Efficient Video Recognition

Zuxuan Wu; Caiming Xiong; Yu-Gang Jiang; Larry S. Davis

2019 NIPS NeurIPS 2019

LiteEval: A Coarse-to-Fine Framework for Resource Efficient Video Recognition

Abstract

This paper presents LiteEval, a simple yet effective coarse-to-fine framework for resource efficient video recognition, suitable for both online and offline scenarios. Exploiting decent yet computationally efficient features derived at a coarse scale with a lightweight CNN model, LiteEval dynamically decides on-the-fly whether to compute more powerful features for incoming video frames at a finer scale to obtain more details. This is achieved by a coarse LSTM and a fine LSTM operating cooperatively, as well as a conditional gating module to learn when to allocate more computation. Extensive experiments are conducted on two large-scale video benchmarks, FCVID and ActivityNet, and the results demonstrate LiteEval requires substantially less computation while offering excellent classification accuracy for both online and offline predictions.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — lightweight cnn

🐣 Hot Topic Early Bird — video recognition

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Zuxuan Wu , Caiming Xiong , Yu-Gang Jiang , Larry S. Davis

Topics

Machine Learning > Core Methods > Classification Machine Learning > Optimization & Theory > Optimization Machine Learning > Application Areas > Efficient Computing Computer Vision > Analysis > Video Understanding Deep Learning > Optimization & Theory > Efficient Computing Deep Learning > Architectures > Convolutional Neural Networks

Keywords

motion analysis video recognition video classification efficient computing resource allocation convolutional neural network dynamic computation lightweight cnn resource efficient

Download PDF

Related papers

Two Generator Game: Learning to Sample via Linear Goodness-of-Fit Test 2019

Metalearned Neural Memory 2019

Model Similarity Mitigates Test Set Overuse 2019

Continual Unsupervised Representation Learning 2019

Reinforcement Learning with Convex Constraints 2019