Effective pruning for the discovery of conditional functional dependencies

Li, Jiuyong and Liu, Jixue and Toivonen, Hannu and Yong, Jianming (2013) Effective pruning for the discovery of conditional functional dependencies. The Computer Journal, 56 (3). pp. 378-392. ISSN 0010-4620

Abstract

Conditional functional dependencies (CFDs) have been proposed as a new type of semantic rules extended from traditional functional dependencies. They have shown great potential for detecting and repairing inconsistent data. Constant CFDs are 100% confidence association rules. The theoretical search space for the minimal set of CFDs is the set of minimal generators and their closures in data. This search space has been used in the currently most efficient constant CFD discovery algorithm. In this paper, we propose pruning criteria to further prune the theoretic search space, and design a fast algorithm for constant CFD discovery. We evaluate the proposed algorithm on a number of media to large real-world data sets. The proposed algorithm is faster than the currently most efficient constant CFD discovery algorithm, and has linear time performance in the size of a data set.


Statistics for USQ ePrint 22121
Statistics for this ePrint Item
Item Type: Article (Commonwealth Reporting Category C)
Refereed: Yes
Item Status: Live Archive
Additional Information: Copyright of Computer Journal is the property of Oxford University Press and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. First published online 24 Jun 2012.
Depositing User: Dr Jianming Yong
Faculty / Department / School: Historic - Faculty of Business and Law - School of Information Systems
Date Deposited: 17 Oct 2012 03:03
Last Modified: 19 Aug 2014 05:01
Uncontrolled Keywords: functional dependencies; conditional functional dependencies; association rules; closed patterns
Fields of Research (FOR2008): 08 Information and Computing Sciences > 0806 Information Systems > 080610 Information Systems Organisation
08 Information and Computing Sciences > 0806 Information Systems > 080604 Database Management
01 Mathematical Sciences > 0101 Pure Mathematics > 010107 Mathematical Logic, Set Theory, Lattices and Universal Algebra
Socio-Economic Objective (SEO2008): E Expanding Knowledge > 97 Expanding Knowledge > 970108 Expanding Knowledge in the Information and Computing Sciences
Identification Number or DOI: doi: 10.1093/comjnl/bxs082
URI: http://eprints.usq.edu.au/id/eprint/22121

Actions (login required)

View Item Archive Repository Staff Only