A unified framework for high-dimensional data stream analysis in fault diagnosis

  • Jianqing Shi
  • , Yicheng Kang
  • , Liqiang Pu*
  • , Dongdong Xiang*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Modern industrial systems routinely generate data in high volume at high velocity. These high-dimensional data streams (HDDS) provide valuable information at granular levels to quality personnel during root cause investigation in cases of a system fault. The goal of fault analysis using HDDS is twofold: (1) identify abnormal data streams and (2) locate the change point when the processes become out of control. Existing research has largely focused on addressing the two issues separately. In this article, we propose a unified framework by formulating the problem as optimal control of hierarchical missed discovery rates in multiple classifications. Theoretically, we establish that our approach minimizes the number of false discoveries while controlling the missed discovery rates at desired levels. Numerically, we develop a computationally efficient algorithm for solving the optimization and demonstrate its superior performance over the existing methods. A data-driven version of the proposed approach is suggested as well. An application to a real data set in semiconductor manufacturing shows that our approach works well in practice.

Original languageEnglish
Pages (from-to)35-50
Number of pages16
JournalJournal of Quality Technology
Volume57
Issue number1
DOIs
StatePublished - 2025

Keywords

  • multiple testing
  • quality engineering
  • statistical process control

Fingerprint

Dive into the research topics of 'A unified framework for high-dimensional data stream analysis in fault diagnosis'. Together they form a unique fingerprint.

Cite this