Towards a Rigorous Evaluation of Time-Series Anomaly Detection

Siwon Kim; Kukjin Choi; Hyun-Soo Choi; Byunghan Lee; Sungroh Yoon

2022 AAAI AAAI 2022

Towards a Rigorous Evaluation of Time-Series Anomaly Detection

Abstract

Abstract In recent years, proposed studies on time-series anomaly detection (TAD) report high F1 scores on benchmark TAD datasets, giving the impression of clear improvements in TAD. However, most studies apply a peculiar evaluation protocol called point adjustment (PA) before scoring. In this paper, we theoretically and experimentally reveal that the PA protocol has a great possibility of overestimating the detection performance; even a random anomaly score can easily turn into a state-of-the-art TAD method. Therefore, the comparison of TAD methods after applying the PA protocol can lead to misguided rankings. Furthermore, we question the potential of existing TAD methods by showing that an untrained model obtains comparable detection performance to the existing methods even when PA is forbidden. Based on our findings, we propose a new baseline and an evaluation protocol. We expect that our study will help a rigorous evaluation of TAD and lead to further improvement in future researches.

🌉 Interdisciplinary Bridge — Data Science & Analytics and Deep Learning and Machine Learning

🧭 Keyword Pioneer — point adjustment

🐣 Hot Topic Early Bird — time series analysis

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Siwon Kim , Kukjin Choi , Hyun-Soo Choi , Byunghan Lee , Sungroh Yoon

Topics

Data Science & Analytics > Methods > Time Series Analysis Machine Learning > Learning Types > Evaluation Deep Learning > Learning Types > Deep Learning Machine Learning > Learning Types > Anomaly Detection Machine Learning > Core Methods > Evaluation

Keywords

time series analysis benchmark evaluation anomaly detection time series anomaly detection benchmark dataset evaluation protocol model performance point adjustment detection performance untrained model time-series anomaly detection

Download PDF

Related papers

Dynamic Spatial Propagation Network for Depth Completion 2022

FedFR: Joint Optimization Federated Framework for Generic and Personalized Face Recognition 2022

Memory-Guided Semantic Learning Network for Temporal Sentence Grounding 2022

AnchorFace: Boosting TAR@FAR for Practical Face Recognition 2022

Parallel and High-Fidelity Text-to-Lip Generation 2022