The influences of number and nature of classifiers on consensus feature selection

Kyriacos Chrysostomou, Sherry Y. Chen, Xiaohui Liu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Wrapper feature selection approaches are widely used to select a small subset of relevant features from a dataset. However, Wrappers suffer from the fact that they only use a single classifier when selecting the features. The downside to this approach is that each classifier will have its own biases and will therefore select very different features. In order to overcome the biases of individual classifiers, we propose Consensus Feature Selection (CFS), which combines different classifiers for feature selection. In this way, selecting classifiers for use in the combinations is very important. Therefore, we investigate how the number and nature of classifiers influence the number of features selected and the classification accuracies that these features generate. In terms of number of classifiers, results showed that few selected more relevant features whereas many selected few features. In addition, 3-classifier combinations selected features that led to highest accuracies. In terms of nature of classifiers, decision trees identified most number of features whereas Bayesian classifiers identified least number of features. However, features selected by Bayesian classifiers led to accuracies higher than the other classifiers.

Original languageEnglish
Title of host publicationProceedings of the 2008 International Conference on Data Mining, DMIN 2008
EditorsR. Stahlbock, S.F. Crone, S. Lessmann
Pages173-179
Number of pages7
StatePublished - 2008
Event2008 International Conference on Data Mining, DMIN 2008 - Las Vegas, NV, United States
Duration: 14 Jul 200817 Jul 2008

Publication series

NameProceedings of the 2008 International Conference on Data Mining, DMIN 2008

Conference

Conference2008 International Conference on Data Mining, DMIN 2008
Country/TerritoryUnited States
CityLas Vegas, NV
Period14/07/0817/07/08

Keywords

  • Bayesian network
  • Consensus
  • Decision tree
  • Feature selection
  • Wrappers

Fingerprint

Dive into the research topics of 'The influences of number and nature of classifiers on consensus feature selection'. Together they form a unique fingerprint.

Cite this