Event Retrieval in Large Video Collections with Circulant Temporal Encoding

Jerome Revaud; Matthijs Douze; Cordelia Schmid; Herve Jegou

2013 CVPR CVPR 2013

Event Retrieval in Large Video Collections with Circulant Temporal Encoding

Abstract

This paper presents an approach for large-scale event retrieval. Given a video clip of a specific event, e.g., the wedding of Prince William and Kate Middleton, the goal is to retrieve other videos representing the same event from a dataset of over 100k videos. Our approach encodes the frame descriptors of a video to jointly represent their appearance and temporal order. It exploits the properties of circulant matrices to compare the videos in the frequency domain. This offers a significant gain in complexity and accurately localizes the matching parts of videos. Furthermore, we extend product quantization to complex vectors in order to compress our descriptors, and to compare them in the compressed domain. Our method outperforms the state of the art both in search quality and query time on two large-scale video benchmarks for copy detection, T RECVID and CC WEB . Finally, we introduce a challenging dataset for event retrieval, EVVE, and report the performance on this dataset.

🚀 Conference Pioneer — CVPR 2013

🌉 Interdisciplinary Bridge — Computer Science and Computer Vision

📈 Trend Setter — Video Understanding

🧭 Keyword Pioneer — event retrieval

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Machine Learning, Natural Language Processing