An Alzheimers disease related genes identification method based on multiple classifier integration

Yu Miao, Huiyan Jiang, Huiling Liu, Yu dong Yao

Research output: Contribution to journalArticlepeer-review

20 Scopus citations

Abstract

Background and Objective: Alzheimers disease (AD) is a fatal neurodegenerative disease and the onset of AD is insidious. Full understanding of the AD-related genes (ADGs) has not been completed. The National Center for Biotechnology Information (NCBI) provides an AD dataset of 22,283 genes. Among these genes, 71 genes have been identified as ADGs. But there may still be underlying ADGs that have not yet been identified in the remaining 22,212 genes. This paper aims to identify additional ADGs using machine learning techniques. Methods: To improve the accuracy of ADG identification, we propose a gene identification method through multiple classifier integration. First, a feature selection algorithm is applied to select the most relevant attributes. Second, a two-stage cascading classifier is developed to identify ADGs. The first stage classification task is based on the relevance vector machine and, in the second stage, the results of three classifiers, support vector machine, random forest and extreme learning machine, are combined through voting. Results: According to our results, feature selection improves accuracy and reduces training time. Voting based classifier reduces the classification errors. The proposed ADG identification system provides accuracy, sensitivity and specificity at levels of 78.77%, 83.10% and 74.67%, respectively. Based on the proposed ADG identification method, potentially additional ADGs are identified and top 13 genes (predicted ADGs) are presented. Conclusions: In this paper, an ADG identification method for identifying ADGs is presented. The proposed method which combines feature selection, cascading classifier and majority voting leads to higher specificity and significantly increases the accuracy and sensitivity of ADG identification. Potentially new ADGs are identified.

Original languageEnglish
Pages (from-to)107-115
Number of pages9
JournalComputer Methods and Programs in Biomedicine
Volume150
DOIs
StatePublished - Oct 2017

Keywords

  • Alzheimers disease
  • Cascading classifier
  • Feature selection
  • Gene identification
  • Majority voting

Fingerprint

Dive into the research topics of 'An Alzheimers disease related genes identification method based on multiple classifier integration'. Together they form a unique fingerprint.

Cite this