Entropy-based model-free feature screening for ultrahigh-dimensional multiclass classification

Research output: Contribution to journalArticlepeer-review

28 Scopus citations

Abstract

Most feature screening methods for ultrahigh-dimensional classification explicitly or implicitly assume the covariates are continuous. However, in the practice, it is quite common that both categorical and continuous covariates appear in the data, and applicable feature screening method is very limited. To handle this non-trivial situation, we propose an entropy-based feature screening method, which is model free and provides a unified screening procedure for both categorical and continuous covariates. We establish the sure screening and ranking consistency properties of the proposed procedure. We investigate the finite sample performance of the proposed procedure by simulation studies and illustrate the method by a real data analysis.

Original languageEnglish
Pages (from-to)515-530
Number of pages16
JournalJournal of Nonparametric Statistics
Volume28
Issue number3
DOIs
StatePublished - 2 Jul 2016

Keywords

  • entropy
  • feature screening
  • information gain
  • multiclass classification
  • sure screening property

Fingerprint

Dive into the research topics of 'Entropy-based model-free feature screening for ultrahigh-dimensional multiclass classification'. Together they form a unique fingerprint.

Cite this