Hu, Hong and Li, Jiuyong and Wang, Hua and Daggard, Grant (2006) Combined gene selection methods for microarray data analysis. In: 10th International Conference Knowledge-Based Intelligent Information and Engineering Systems, 9-11 Oct 2006, Bournemouth, UK.
PDF (Accepted Version)
PDF (Published Version)
[Abstract]: In recent years, the rapid development of DNA Microarray technology has made it possible for scientists to monitor the expression level of thousands of genes in a single experiment. As a new technology, Microarray data presents some fresh challenges to scientists since Microarray data contains a large number of genes (around tens thousands) with a small number of samples (around hundreds). Both filter and wrapper gene selection methods aim to select the most informative genes among the massive data in order to reduce the size of the expression database. Gene selection methods are used in both data preprocessing and classification stages. We have conducted some experiments on different existing gene selection methods to preprocess Microarray data for classification by benchmark algorithms SVMs and C4.5. The study suggests that the combination of filter and wrapper methods in general improve the accuracy performance of gene expression Microarray data classification. The study also indicates that not all filter gene selection methods help improve the performance of classification. The experimental results show that among tested gene selection methods, Correlation Coefficient is the best gene selection method for improving the classification accuracy on both SVMs and C4.5 classification algorithms.
|Item Type:||Conference or Workshop Item (Commonwealth Reporting Category E) (Paper)|
|Additional Information:||Deposited in accordance with the copyright policy of the publisher. Copyright 2006 Springer. This is the authors' version of the work. It is posted here with permission of the publisher for your personal use. No further distribution is permitted. The item is also available in Lecture Notes in Computer Science v. 4251 at http://www.springerlink.com|
|Uncontrolled Keywords:||classification; gene selection; Microarray data|
|Subjects:||270000 Biological Sciences > 270800 Biotechnology > 270899 Biotechnology not elsewhere classified
280000 Information, Computing and Communication Sciences
|Depositing User:||Dr Jiuyong (John) Li|
|Date Deposited:||11 Oct 2007 00:57|
|Last Modified:||02 Jul 2013 22:42|
Actions (login required)
|Archive Repository Staff Only|