GPU-accelerated feature selection for outlier detection using the local kernel density ratio

Fatemeh Azmandian*, Ayse Yilmazer, Jennifer G. Dy, Javed A. Aslam, David R. Kaeli

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

29 Citations (Scopus)

Abstract

Effective outlier detection requires the data to be described by features that capture the behavior of normal data while emphasizing those characteristics of outliers which make them different than normal data. In this work, we present a novel non-parametric evaluation criterion for filter-based feature selection which caters to outlier detection problems. The proposed method seeks the subset of features that represent the inherent characteristics of the normal dataset while forcing outliers to stand out, making them more easily distinguished by outlier detection algorithms. Experimental results on real datasets show the advantage of our feature selection algorithm compared to popular and state-of-the-art methods. We also show that the proposed algorithm is able to overcome the small sample space problem and perform well on highly imbalanced datasets. Furthermore, due to the highly parallelizable nature of the feature selection, we implement the algorithm on a graphics processing unit (GPU) to gain significant speedup over the serial version. The benefits of the GPU implementation are two-fold, as its performance scales very well in terms of the number of features, as well as the number of data points.

Original languageEnglish
Title of host publicationProceedings - 12th IEEE International Conference on Data Mining, ICDM 2012
Pages51-60
Number of pages10
DOIs
Publication statusPublished - 2012
Externally publishedYes
Event12th IEEE International Conference on Data Mining, ICDM 2012 - Brussels, Belgium
Duration: 10 Dec 201213 Dec 2012

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Conference

Conference12th IEEE International Conference on Data Mining, ICDM 2012
Country/TerritoryBelgium
CityBrussels
Period10/12/1213/12/12

Keywords

  • Feature selection
  • GPU acceleration
  • Imbalanced data
  • Outlier detection

Fingerprint

Dive into the research topics of 'GPU-accelerated feature selection for outlier detection using the local kernel density ratio'. Together they form a unique fingerprint.

Cite this