A genetic algorithm based technique for outlier detection with fast convergence

Zhu, Xiaodong and Zhang, Ji and Hu, Zewen and Li, Hongzhou and Chang, Liang and Zhu, Youwen and Lin, Jerry Chun-Wei and Qin, Yongrui (2018) A genetic algorithm based technique for outlier detection with fast convergence. In: 14th International Conference on Advanced Data Mining and Applications (ADMA 2018), 16-18 Nov 2018, Nanjing, China.


Abstract

In this paper, we study the problem of subspace outlier detection in high dimensional data space and propose a new genetic algorithm-based technique to identify outliers embedded in subspaces. The existing technique, mainly using genetic algorithm (GA) to carry out the subspace search, is generally slow due to its expensive fitness evaluation and long solution encoding scheme. In this paper, we propose a novel technique to improve the performance of the existing GA-based outlier detection method using a bit freezing approach to achieve a faster convergence. Through freezing converged bits in the solution encoding strings, this innovative approach can contribute to fast crossover and mutation operations and achieve an early stop of the GA that leads to more accurate approximation of fitness function. This research work can contribute to the development of a more efficient search method for detecting subspace outliers. The experimental results demonstrate the improved efficiency of our technique compared with the existing method.


Statistics for USQ ePrint 36149
Statistics for this ePrint Item
Item Type: Conference or Workshop Item (Commonwealth Reporting Category E) (Paper)
Refereed: Yes
Item Status: Live Archive
Additional Information: © Springer Nature Switzerland AG 2018.
Faculty/School / Institute/Centre: Historic - Faculty of Health, Engineering and Sciences - School of Agricultural, Computational and Environmental Sciences (1 Jul 2013 - 5 Sep 2019)
Faculty/School / Institute/Centre: Historic - Faculty of Health, Engineering and Sciences - School of Agricultural, Computational and Environmental Sciences (1 Jul 2013 - 5 Sep 2019)
Date Deposited: 23 Jul 2020 05:37
Last Modified: 24 Sep 2020 01:40
Uncontrolled Keywords: outlier detection; data stream; nominal or categorical data
Fields of Research (2008): 08 Information and Computing Sciences > 0801 Artificial Intelligence and Image Processing > 080109 Pattern Recognition and Data Mining
Identification Number or DOI: https://doi.org/10.1007/978-3-030-05090-0_8
URI: http://eprints.usq.edu.au/id/eprint/36149

Actions (login required)

View Item Archive Repository Staff Only