A robust ensemble classification method analysis

Zhang, Zhongwei and Li, Jiuyong and Hu, Hong and Zhou, Hong (2010) A robust ensemble classification method analysis. In: 2009 International Conference on Bioinformatics and Computational Biology , 13-16 Jul 2009, Las Vegas, NV. United States.

Metadata

HTML CitationEndNoteDublin CoreReference Manager

Full text available as:

[img]
Preview
PDF (Accepted Version - Chapter 17) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
57Kb

Official URL: http://www.springer.com/life+sciences/bioinformatics/book/978-1-4419-5912-6

Identification Number or DOI: doi: 10.1007/978-1-4419-4913-3_17

Abstract

Apart from the dimensionality problem, the uncertainty of Microarray data quality is another major challenge of Microarray classification. Microarray data contains various levels of noise and quite often are high levels of noise, and these data lead to unreliable and low accuracy analysis as well as the high dimensionality problem. In this paper, we propose a new Microarray data classification method, based on diversified multiple trees. The new method contains features that, (1) make most use of the information from the abundant genes in the Microarray data, and (2) use a unique diversity measurement in the ensemble decision committee. The experimental results show that the proposed classification method (DMDT) and the well known method (CS4), which diversifies trees by using distinct tree roots, are more accurate on average than other well-known ensemble methods, including Bagging, Boosting and Random Forests. The experiments also indicate that using diversity measurement of DMDT improves the classification accuracy of ensemble classification on Microarray data.

Item Type:Conference or Workshop Item (Commonwealth Reporting Category E) (Paper)
Additional Information:Chapter 17. Accepted version deposited with blanket permission of publisher. Print copy held USQ Library 570.285 Adv.
Uncontrolled Keywords:microarray gene data; classification method; ensemble decision tree; diversity; accuracy
Fields of Research (FOR2008):01 Mathematical Sciences > 0103 Numerical and Computational Mathematics > 010399 Numerical and Computational Mathematics not elsewhere classified
06 Biological Sciences > 0604 Genetics > 060405 Gene Expression (incl. Microarray and other genome-wide approaches)
06 Biological Sciences > 0603 Evolutionary Biology > 060399 Evolutionary Biology not elsewhere classified
Subjects:UNSPECIFIED
Socio-Economic Objective (SEO2008):E Expanding Knowledge > 97 Expanding Knowledge > 970111 Expanding Knowledge in the Medical and Health Sciences
ID Code:8924
Deposited By:
Deposited On:07 Jun 2011 09:11
Last Modified:17 Feb 2012 15:03

Archive Staff Only: edit this record