Efficient systematic clustering method for k-anonymization

Kabir, Md Enamul and Wang, Hua and Bertino, Elisa (2011) Efficient systematic clustering method for k-anonymization. Acta Informatica, 48 (1). pp. 51-66. ISSN 0001-5903

Abstract

This paper presents a clustering (Clustering partitions record into clusters such that records within a cluster are similar to each other, while records in different clusters are most distinct from one another.) based k-anonymization technique to minimize the information loss while at the same time assuring data quality. Privacy preservation of individuals has drawn considerable interests in data mining research. The k-anonymity model proposed by Samarati and Sweeney is a practical approach for data privacy preservation and has been studied extensively for the last few years. Anonymization methods via generalization or suppression are able to protect private information, but lose valued information. The challenge is how to minimize the information loss during the anonymization process. We refer to the challenge as a systematic clustering problem for k-anonymization which is analysed in this paper. The proposed technique adopts group-similar data together and then anonymizes each group individually. The structure of systematic clustering problem is defined and investigated through paradigm and properties. An algorithm of the proposed problem is developed and shown that the time complexity is in O( n2 k ), where n is the total number of records containing individuals concerning their privacy. Experimental results show that our method attains a reasonable dominance with respect to both information loss and execution time. Finally the algorithm illustrates the usability for incremental datasets.


Statistics for USQ ePrint 18227
Statistics for this ePrint Item
Item Type: Article (Commonwealth Reporting Category C)
Refereed: Yes
Item Status: Live Archive
Additional Information: Permanent restricted access to published version due to publisher copyright restrictions.
Depositing User: Mr Md Enamul Kabir
Faculty / Department / School: Historic - Faculty of Sciences - Department of Maths and Computing
Date Deposited: 18 Feb 2011 23:06
Last Modified: 03 Jul 2013 00:27
Uncontrolled Keywords: k-anonymity; systematic clustering; privacy
Fields of Research (FOR2008): 08 Information and Computing Sciences > 0806 Information Systems > 080603 Conceptual Modelling
15 Commerce, Management, Tourism and Services > 1503 Business and Management > 150313 Quality Management
08 Information and Computing Sciences > 0803 Computer Software > 080303 Computer System Security
Socio-Economic Objective (SEO2008): B Economic Development > 89 Information and Communication Services > 8902 Computer Software and Services > 890202 Application Tools and System Utilities
Identification Number or DOI: doi: 10.1007/s00236-010-0131-6
URI: http://eprints.usq.edu.au/id/eprint/18227

Actions (login required)

View Item Archive Repository Staff Only