VADER: Video Alignment Differencing and Retrieval

Alexander Black; Simon Jenni; Tu Bui; Md. Mehrab Tanjim; Stefano Petrangeli; Ritwik Sinha; Viswanathan Swaminathan; John Collomosse

2023 ICCV ICCV 2023

VADER: Video Alignment Differencing and Retrieval

Abstract

We propose VADER, a spatio-temporal matching, alignment, and change summarization method to help fight misinformation spread via manipulated videos. VADER matches and coarsely aligns partial video fragments to candidate videos using a robust visual descriptor and scalable search over adaptively chunked video content. A transformer-based alignment module then refines the temporal localization of the query fragment within the matched video. A space-time comparator module identifies regions of manipulation between aligned content, invariant to any changes due to any residual temporal misalignments or artifacts arising from non-editorial changes of the content. Robustly matching video to a trusted source enables conclusions to be drawn on video provenance, enabling informed trust decisions on content encountered. Code and data are available at https://github.com/AlexBlck/vader

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

🧭 Keyword Pioneer — video manipulation detection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Alexander Black , Simon Jenni , Tu Bui , Md. Mehrab Tanjim , Stefano Petrangeli , Ritwik Sinha , Viswanathan Swaminathan , John Collomosse

Topics

Deep Learning > Architectures > Transformers Computer Vision > Processing > Video Processing Computer Vision > Processing > Video Understanding

Keywords

video retrieval misinformation detection video alignment video manipulation detection spatio-temporal matching transformer-based alignment

Download PDF

Related papers

PVT++: A Simple End-to-End Latency-Aware Visual Tracking Framework 2023

Periodically Exchange Teacher-Student for Source-Free Object Detection 2023

Stable and Causal Inference for Discriminative Self-supervised Deep Visual Representations 2023

Minimal Solutions to Uncalibrated Two-view Geometry with Known Epipoles 2023

3D Neural Embedding Likelihood: Probabilistic Inverse Graphics for Robust 6D Pose Estimation 2023