Statistical bias and variance of gene selection and cross validation methods: A case study on hypertension prediction

Zeliha Gormez*, Olcay Kursun, Ahmet Sertbas, Nizamettin Aydin, Huseyin Seker

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

In exploratory association studies of genes with certain diseases, a single or a small number of genes (features) related with the diseases are selected 1 among many thousands investigated. We investigate the statistical bias and variance of simple yet common (correlation and mutual information based) feature selection algorithms using well-known cross-validation methods (leave-one-out and k-fold) on a gene finding study for hypertension prediction. Our findings show that selected genes are different for different methods and different cross-validation runs for both single gene selection and gene subset selection.

Original languageEnglish
Title of host publicationProceedings - IEEE-EMBS International Conference on Biomedical and Health Informatics
Subtitle of host publicationGlobal Grand Challenge of Health Informatics, BHI 2012
Pages616-619
Number of pages4
DOIs
Publication statusPublished - 2012
Externally publishedYes
EventIEEE-EMBS International Conference on Biomedical and Health Informatics, BHI 2012. In Conj. with the 8th Int. Symp.on Medical Devices and Biosensors and the 7th Int. Symp. on Biomedical and Health Engineering - Hong Kong and Shenzhen, China
Duration: 2 Jan 20127 Jan 2012

Publication series

NameProceedings - IEEE-EMBS International Conference on Biomedical and Health Informatics: Global Grand Challenge of Health Informatics, BHI 2012

Conference

ConferenceIEEE-EMBS International Conference on Biomedical and Health Informatics, BHI 2012. In Conj. with the 8th Int. Symp.on Medical Devices and Biosensors and the 7th Int. Symp. on Biomedical and Health Engineering
Country/TerritoryChina
CityHong Kong and Shenzhen
Period2/01/127/01/12

Fingerprint

Dive into the research topics of 'Statistical bias and variance of gene selection and cross validation methods: A case study on hypertension prediction'. Together they form a unique fingerprint.

Cite this