Robust rule-based prediction

Li, Jiuyong (2006) Robust rule-based prediction. IEEE Transactions on Knowledge and Data Engineering, 18 (8). pp. 1043-1054. ISSN 1041-4347

Metadata

HTML CitationEndNoteDublin CoreReference Manager

Full text available as:

[img]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
1647Kb

Official URL: http://dx.doi.org/10.1109/TKDE.2006.129

Identification Number or DOI: doi: 10.1109/TKDE.2006.129

Abstract

This paper studies a problem of robust rule-based classification, i.e. making predictions in the presence of missing values in data. This study differs from other missing value handling research in that it does not handle missing values but builds a rule based classification model to tolerate missing values. Based on a commonly used rule-based classification model, we characterise the robustness of a hierarchy of rule sets, k-optimal rule sets with the decreasing size corresponds to the decreasing robustness. We build classifiers based on k-optimal rule sets and show experimentally that they are more robust than some benchmark rule-based classifiers, such as C4.5rules and CBA.We also show that the proposed approach is better than two well known missing value handling methods for missing values in test data.

Item Type:Article (Commonwealth Reporting Category C)
Additional Information:Published version deposited in accordance with the copyright policy of the publisher. This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder. Copyright 2006 IEEE. Personal use of this material is permitted. This material is posted here with permission of the IEEE. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
Uncontrolled Keywords:data mining, rule, classification, robustness
Fields of Research (FOR2008):08 Information and Computing Sciences > 0801 Artificial Intelligence and Image Processing > 080109 Pattern Recognition and Data Mining
Subjects:280000 Information, Computing and Communication Sciences > 280200 Artificial Intelligence and Signal and Image Processing > 280213 Other Artificial Intelligence
Socio-Economic Objective (SEO2008):UNSPECIFIED
ID Code:2088
Deposited By:
Deposited On:11 Oct 2007 10:57
Last Modified:13 Dec 2011 09:12

Archive Staff Only: edit this record