The Fuzzy Gene Filter: A classifier performance assessment

Meir Perez, Tshilidzi Marwala

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

The Fuzzy Gene Filter (FGF) is an optimised Fuzzy Inference System designed to rank genes in order of differential expression, based on expression data generated in a microarray experiment. This paper examines the effectiveness of the FGF for feature selection using various classification architectures. The FGF is compared to three of the most common gene ranking algorithms: t-test, Wilcoxon test and ROC curve analysis. Four classification schemes are used to compare the performance of the FGF vis-à-vis the standard approaches: K-Nearest Neighbour (KNN), Support Vector Machine (SVM), Naïve Bayesian Classifier (NBC) and Artificial Neural Network (ANN). A nested stratified Leave-One-Out Cross Validation scheme is used to identify the optimal number top ranking genes, as well as the optimal classifier parameters. Two microarray data sets are used for the comparison: a prostate cancer data set and a lymphoma data set. Genes ranked by the FGF attained significantly higher accuracies for all of the classifiers tested, on both data sets (p = 0.0231 for the prostate data set and p = 0.1888 for the lymphoma data set). When using the prostate data set, the FGF performed best on the KNN classifier, achieving an accuracy of 96.1% with the top 9 ranking genes. When using the lymphoma data set, the FGF performed best on the SVM classifier, achieving an accuracy of 100% with the top 12 ranking genes. The performance of the FGF is attributed to the fact that it is optimised to rank genes in such a way that results in maximum class separability, as well as its incorporation of multiple features of the data when ranking genes.

Original languageEnglish
Title of host publicationProceedings of the 2nd IASTED International Conference on Computational Bioscience, CompBio 2011
Pages406-413
Number of pages8
DOIs
Publication statusPublished - 2011
Event2nd International Conference on Computational Bioscience, CompBio 2011 - Cambridge, United Kingdom
Duration: 11 Jul 201113 Jul 2011

Publication series

NameProceedings of the 2nd IASTED International Conference on Computational Bioscience, CompBio 2011

Conference

Conference2nd International Conference on Computational Bioscience, CompBio 2011
Country/TerritoryUnited Kingdom
CityCambridge
Period11/07/1113/07/11

Keywords

  • Classifier
  • Feature selection
  • Fuzzy Gene Filter
  • Microarray

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Theoretical Computer Science

Fingerprint

Dive into the research topics of 'The Fuzzy Gene Filter: A classifier performance assessment'. Together they form a unique fingerprint.

Cite this