TY - JOUR
T1 - TAB
T2 - 51st International Conference on Very Large Data Bases, VLDB 2025
AU - Qiu, Xiangfei
AU - Li, Zhe
AU - Qiu, Wanghui
AU - Hu, Shiyan
AU - Zhou, Lekui
AU - Wu, Xingjian
AU - Li, Zhengyu
AU - Guo, Chenjuan
AU - Zhou, Aoying
AU - Sheng, Zhenli
AU - Hu, Jilin
AU - Jensen, Christian S.
AU - Yang, Bin
N1 - Publisher Copyright:
© 2025, VLDB Endowment. All rights reserved.
PY - 2025
Y1 - 2025
N2 - Time series anomaly detection (TSAD) plays an important role in many domains such as finance, transportation, and healthcare. With the ongoing instrumentation of reality, more time series data will be available, leading also to growing demands for TSAD. While many TSAD methods already exist, new and better methods are still desirable. However, effective progress hinges on the availability of reliable means of evaluating new methods and comparing them with existing methods. We address deficiencies in current evaluation procedures related to datasets and experimental settings and protocols. Specifically, we propose a new time series anomaly detection benchmark, called TAB. First, TAB encompasses 29 public multivariate datasets and 1,635 univariate time series from different domains to facilitate more comprehensive evaluations on diverse datasets. Second, TAB covers a variety of TSAD methods, including Non-learning, Machine learning, Deep learning, LLM-based, and Time-series pre-trained methods. Third, TAB features a unified and automated evaluation pipeline that enables fair and easy evaluation of TSAD methods. Finally, we employ TAB to evaluate existing TSAD methods and report on the outcomes, thereby offering a deeper insight into the performance of these methods.
AB - Time series anomaly detection (TSAD) plays an important role in many domains such as finance, transportation, and healthcare. With the ongoing instrumentation of reality, more time series data will be available, leading also to growing demands for TSAD. While many TSAD methods already exist, new and better methods are still desirable. However, effective progress hinges on the availability of reliable means of evaluating new methods and comparing them with existing methods. We address deficiencies in current evaluation procedures related to datasets and experimental settings and protocols. Specifically, we propose a new time series anomaly detection benchmark, called TAB. First, TAB encompasses 29 public multivariate datasets and 1,635 univariate time series from different domains to facilitate more comprehensive evaluations on diverse datasets. Second, TAB covers a variety of TSAD methods, including Non-learning, Machine learning, Deep learning, LLM-based, and Time-series pre-trained methods. Third, TAB features a unified and automated evaluation pipeline that enables fair and easy evaluation of TSAD methods. Finally, we employ TAB to evaluate existing TSAD methods and report on the outcomes, thereby offering a deeper insight into the performance of these methods.
UR - https://www.scopus.com/pages/publications/105014238580
U2 - 10.14778/3746405.3746407
DO - 10.14778/3746405.3746407
M3 - 会议文章
AN - SCOPUS:105014238580
SN - 2150-8097
VL - 18
SP - 2775
EP - 2789
JO - Proceedings of the VLDB Endowment
JF - Proceedings of the VLDB Endowment
Y2 - 1 September 2025 through 5 September 2025
ER -