TY - JOUR
T1 - Software defect prediction based on enhanced metaheuristic feature selection optimization and a hybrid deep neural network
AU - Zhu, Kun
AU - Ying, Shi
AU - Zhang, Nana
AU - Zhu, Dandan
N1 - Publisher Copyright:
© 2021
PY - 2021/10
Y1 - 2021/10
N2 - Software defect prediction aims to identify the potential defects of new software modules in advance by constructing an effective prediction model. However, the model performance is susceptible to irrelevant and redundant features. In addition, previous studies mainly use traditional data mining or machine learning techniques for defect prediction, the prediction performance is not superior enough. For the first issue, motivated by the idea of search based software engineering, we leverage the recently proposed whale optimization algorithm (WOA) and another complementary simulated annealing (SA) to construct an enhanced metaheuristic search based feature selection algorithm named EMWS, which can effectively select fewer but closely related representative features. For the second issue, we employ a hybrid deep neural network — convolutional neural network (CNN) and kernel extreme learning machine (KELM) to construct a unified defect prediction predictor called WSHCKE, which can further integrate the selected features into the abstract deep semantic features by CNN and boost the prediction performance by taking full advantage of the strong classification capacity of KELM. We conduct extensive experiments for feature selection or extraction and defect prediction across 20 widely-studied software projects on four evaluation indicators. Experimental results demonstrate the superiority of EMWS and WSHCKE.
AB - Software defect prediction aims to identify the potential defects of new software modules in advance by constructing an effective prediction model. However, the model performance is susceptible to irrelevant and redundant features. In addition, previous studies mainly use traditional data mining or machine learning techniques for defect prediction, the prediction performance is not superior enough. For the first issue, motivated by the idea of search based software engineering, we leverage the recently proposed whale optimization algorithm (WOA) and another complementary simulated annealing (SA) to construct an enhanced metaheuristic search based feature selection algorithm named EMWS, which can effectively select fewer but closely related representative features. For the second issue, we employ a hybrid deep neural network — convolutional neural network (CNN) and kernel extreme learning machine (KELM) to construct a unified defect prediction predictor called WSHCKE, which can further integrate the selected features into the abstract deep semantic features by CNN and boost the prediction performance by taking full advantage of the strong classification capacity of KELM. We conduct extensive experiments for feature selection or extraction and defect prediction across 20 widely-studied software projects on four evaluation indicators. Experimental results demonstrate the superiority of EMWS and WSHCKE.
KW - Convolutional neural network
KW - Kernel extreme learning machine
KW - Metaheuristic feature selection
KW - Software defect prediction
KW - Whale optimization algorithm
UR - https://www.scopus.com/pages/publications/85108275396
U2 - 10.1016/j.jss.2021.111026
DO - 10.1016/j.jss.2021.111026
M3 - 文章
AN - SCOPUS:85108275396
SN - 0164-1212
VL - 180
JO - Journal of Systems and Software
JF - Journal of Systems and Software
M1 - 111026
ER -