TY - JOUR
T1 - Fault classification for high-dimensional data streams
T2 - A directional diagnostic framework based on multiple hypothesis testing
AU - Xiang, Dongdong
AU - Li, Wendong
AU - Tsung, Fugee
AU - Pu, Xiaolong
AU - Kang, Yicheng
N1 - Publisher Copyright:
© 2021 Wiley Periodicals LLC
PY - 2021/10
Y1 - 2021/10
N2 - In various modern statistical process control applications that involve high-dimensional data streams (HDDS), accurate fault diagnosis of out-of-control (OC) streams is becoming crucial. The existing diagnostic approaches either focus on moderate-dimensional processes or are unable to determine the shift direction accurately, especially when the signal-to-noise ratio is low. In this paper, we conduct a bold trial and consider the fault classification problem of the mean vector of HDDS where determining the shift direction of the OC streams is important to perform customized repairs. To this end, under the basic assumptions that the in-control data streams are normal with mean 0 and variance 1, and that the high-dimensional observations after the alarm are solely OC, the problem is formulated into a three-classification multiple testing framework, and an efficient data-driven diagnostic procedure is developed to minimize the expected number of false positives and to control the missed discovery rate at given level. The procedure is statistically optimal and computationally efficient, and improves the diagnostic effectiveness by considering directional information, which provides insights to guide further decisions. Both theoretical and numerical results reveal the superiority of the new method.
AB - In various modern statistical process control applications that involve high-dimensional data streams (HDDS), accurate fault diagnosis of out-of-control (OC) streams is becoming crucial. The existing diagnostic approaches either focus on moderate-dimensional processes or are unable to determine the shift direction accurately, especially when the signal-to-noise ratio is low. In this paper, we conduct a bold trial and consider the fault classification problem of the mean vector of HDDS where determining the shift direction of the OC streams is important to perform customized repairs. To this end, under the basic assumptions that the in-control data streams are normal with mean 0 and variance 1, and that the high-dimensional observations after the alarm are solely OC, the problem is formulated into a three-classification multiple testing framework, and an efficient data-driven diagnostic procedure is developed to minimize the expected number of false positives and to control the missed discovery rate at given level. The procedure is statistically optimal and computationally efficient, and improves the diagnostic effectiveness by considering directional information, which provides insights to guide further decisions. Both theoretical and numerical results reveal the superiority of the new method.
KW - data-driven
KW - directional isolation
KW - high-dimensional fault diagnosis
KW - multiple testing
KW - statistical process control
UR - https://www.scopus.com/pages/publications/85107819928
U2 - 10.1002/nav.22008
DO - 10.1002/nav.22008
M3 - 文章
AN - SCOPUS:85107819928
SN - 0894-069X
VL - 68
SP - 973
EP - 987
JO - Naval Research Logistics
JF - Naval Research Logistics
IS - 7
ER -