A comparative analysis of methods for probability estimation tree

  • Na Chu*
  • , Lizhuang Ma
  • , Ping Liu
  • , Yiyang Hu
  • , Min Zhou
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

In this paper, we address the problem of probability estimation of decision trees. This problem has received considerable attention in the areas of machine learning and data mining, and techniques to use tree models as probability estimators have been suggested. We make a comparative study of six well-known class probability estimation methods, measured by classification accuracy, AUC and Conditional Log Likelihood (CLL). Comments on the properties of each method are empirically supported. Our experiments on UCI data sets and our liver disease data sets show that the PETs algorithms outperform traditional decision trees and naïve Bayes significantly in classification accuracy, AUC and CLL respectively. Finally, a unifying pseudocode of algorithm is summarized in this paper.

Original languageEnglish
Pages (from-to)71-80
Number of pages10
JournalWSEAS Transactions on Computers
Volume10
Issue number3
StatePublished - Mar 2011
Externally publishedYes

Keywords

  • AUC
  • Classification
  • Conditional log likelihood
  • Decision trees
  • Joint distribution
  • Probability estimation tree

Fingerprint

Dive into the research topics of 'A comparative analysis of methods for probability estimation tree'. Together they form a unique fingerprint.

Cite this