摘要
With the broader adoption of machine learning (ML) in security-critical fields, the requirements for the explainability of ML are also increasing. The explainability aims at helping people understand models’ internal working principles and decision basis, which adds their realibility. However, the research on understanding ML models, such as random forest (RF), is still in the infant stage. Considering the strict and standardized characteristics of formal methods and their wide application in the field of ML in recent years, this work leverages formal methods and logical reasoning to develop a machine learning interpretability method for explaining the prediction of RF. Specifically, the decision-making process of RF is encoded into first-order logic formula, and the proposed approach is centered around minimal unsatisfiable cores (MUC) and local interpretation of feature importance and counterfactual sample generation method are provided. Experimental results on several public datasets illustrate the high quality of the proposed feature importance measurement, and the counterfactual sample generation method outperforms the state-of-the-art method. Moreover, from the perspective of user friendliness, the user report can be generated according to the analysis results of counterfactual samples, which can provide suggestions for users to improve their own situation in real-life applications.
| 投稿的翻译标题 | Minimal-unsatisfiable-core-driven Local Explainability Analysis for Random Forest |
|---|---|
| 源语言 | 繁体中文 |
| 页(从-至) | 2447-2463 |
| 页数 | 17 |
| 期刊 | Ruan Jian Xue Bao/Journal of Software |
| 卷 | 33 |
| 期 | 7 |
| DOI | |
| 出版状态 | 已出版 - 7月 2022 |
关键词
- counterfactual sample
- explainable machine learning
- feature importance
- formal method
- logical reasoning
指纹
探究 '基于最小不满足核的随机森林局部解释性分析' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver