Random relevant and non-redundant feature subspaces for co-training

Yusuf Yaslan*, Zehra Cataltepe

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Random feature subspace selection can produce diverse classifiers and help with Co-training as shown by RASCO algorithm of Wang et al. 2008. For data sets with many irrelevant or noisy feature, RASCO may end up with inaccurate classifiers. In order to remedy this problem, we introduce two algorithms for selecting relevant and non-redundant feature subspaces for Co-training. The first algorithm Rel-RASCO (Relevant Random Subspaces for Co-training) produces subspaces by drawing features with probabilities proportional to their relevances. We also modify a successful feature selection algorithm, mRMR (Minimum Redundancy Maximum Relevance), for random feature subset selection and introduce Prob-mRMR (Probabilistic-mRMR). Experiments on 5 datasets demonstrate that the proposed algorithms outperform both RASCO and Co-training in terms of accuracy achieved at the end of Co-training. Theoretical analysis of the proposed algorithms is also provided.

Original languageEnglish
Title of host publicationIntelligent Data Engineering and Automated Learning - IDEAL 2009 - 10th International Conference, Proceedings
Pages679-686
Number of pages8
DOIs
Publication statusPublished - 2009
Event10th International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2009 - Burgos, Spain
Duration: 23 Sept 200926 Sept 2009

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5788 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference10th International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2009
Country/TerritorySpain
CityBurgos
Period23/09/0926/09/09

Keywords

  • Co-training
  • MRMR
  • RASCO
  • Random Subspace Methods

Fingerprint

Dive into the research topics of 'Random relevant and non-redundant feature subspaces for co-training'. Together they form a unique fingerprint.

Cite this