Explaining GBDT by Probabilistic Finite-State Automata

  • Yinkai Chen
  • , Rui Zhang
  • , Xin Qiu
  • , Xin Li*
  • , Yuxin Deng
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Explainable artificial intelligence becomes vital for human users to understand and trust the decision-making process and results of machine learning methods. Unfortunately, most machine learning models are black-box and the algorithms running behind are opaque. In this work, we propose an approach to interpreting GBDT (Gradient Boosting Decision Tree Explanation) by extracting probabilistic finite-state automata from the trained model. Our method is inspired by and built upon a previous work that extracts probabilistic automata from RNN (Recurrent Neural Networks). To adapt the approach to our situation, we propose a series of techniques to ensure that the extracted probabilistic automaton approximates the GBDT model as accurately as possible. We conduct experiments on real-world datasets and our experimental results show that our method maintains a high level of fidelity of the extracted model as the size of the given GBDT model grows.

Original languageEnglish
Title of host publicationICCPR 2021 - Proceedings of 2021 10th International Conference on Computing and Pattern Recognition
PublisherAssociation for Computing Machinery
Pages328-333
Number of pages6
ISBN (Electronic)9781450390439
DOIs
StatePublished - 15 Oct 2021
Event10th International Conference on Computing and Pattern Recognition, ICCPR 2021 - Virtual, Online, China
Duration: 15 Oct 202117 Oct 2021

Publication series

NameACM International Conference Proceeding Series

Conference

Conference10th International Conference on Computing and Pattern Recognition, ICCPR 2021
Country/TerritoryChina
CityVirtual, Online
Period15/10/2117/10/21

Keywords

  • Gradient Boosting Decision Tree
  • Interpretable machine learning
  • Probabilistic finite-state automata

Fingerprint

Dive into the research topics of 'Explaining GBDT by Probabilistic Finite-State Automata'. Together they form a unique fingerprint.

Cite this