A robust ensemble classification method analysis

Zhang, Zhongwei and Li, Jiuyong and Hu, Hong and Zhou, Hong (2010) A robust ensemble classification method analysis. In: 2009 International Conference on Bioinformatics and Computational Biology , 13-16 Jul 2009, Las Vegas, NV. United States.

[img]
Preview
PDF (Accepted Version - Chapter 17)
Zhang_Li_Hu_Zhou_AV.pdf

Download (57Kb)

Abstract

Apart from the dimensionality problem, the uncertainty of Microarray data quality is another major challenge of Microarray classification. Microarray data contains various levels of noise and quite often are high levels of noise, and these data lead to unreliable and low accuracy analysis as well as the high dimensionality problem. In this paper, we propose a new Microarray data classification method, based on diversified multiple trees. The new method contains features that, (1) make most use of the information from the abundant genes in the Microarray data, and (2) use a unique diversity measurement in the ensemble decision committee. The experimental results show that the proposed classification method (DMDT) and the well known method (CS4), which diversifies trees by using distinct tree roots, are more accurate on average than other well-known ensemble methods, including Bagging, Boosting and Random Forests. The experiments also indicate that using diversity measurement of DMDT improves the classification accuracy of ensemble classification on Microarray data.


Statistics for USQ ePrint 8924
Statistics for this ePrint Item
Item Type: Conference or Workshop Item (Commonwealth Reporting Category E) (Paper)
Refereed: Yes
Item Status: Live Archive
Additional Information: Chapter 17. Accepted version deposited with blanket permission of publisher. Print copy held USQ Library 570.285 Adv.
Depositing User: Dr Zhongwei Zhang
Faculty / Department / School: Historic - Faculty of Sciences - Department of Maths and Computing
Date Deposited: 06 Jun 2011 23:11
Last Modified: 03 Jul 2013 00:05
Uncontrolled Keywords: microarray gene data; classification method; ensemble decision tree; diversity; accuracy
Fields of Research (FOR2008): 01 Mathematical Sciences > 0103 Numerical and Computational Mathematics > 010399 Numerical and Computational Mathematics not elsewhere classified
06 Biological Sciences > 0604 Genetics > 060405 Gene Expression (incl. Microarray and other genome-wide approaches)
06 Biological Sciences > 0603 Evolutionary Biology > 060399 Evolutionary Biology not elsewhere classified
Socio-Economic Objective (SEO2008): E Expanding Knowledge > 97 Expanding Knowledge > 970111 Expanding Knowledge in the Medical and Health Sciences
Identification Number or DOI: doi: 10.1007/978-1-4419-4913-3_17
URI: http://eprints.usq.edu.au/id/eprint/8924

Actions (login required)

View Item Archive Repository Staff Only