Detecting projected outliers in high-dimensional data streams

Zhang, Ji and Gao, Qigang and Wang, Hai and Liu, Qing and Xu, Kai (2009) Detecting projected outliers in high-dimensional data streams. In: DEXA 2009: 20th International Conference on Database and Expert Systems Applications, 31 Aug- 4Sep 2009, Linz, Austria.

PDF (Accepted Version)

Download (279Kb)


In this paper, we study the problem of projected outlier detection in high dimensional data streams and propose a new technique, called Stream Projected Ouliter deTector (SPOT), to identify outliers embedded in subspaces. Sparse Subspace Template (SST), a set of subspaces obtained by unsupervised and/or supervised learning processes, is constructed in SPOT to detect projected outliers effectively. Multi-Objective Genetic Algorithm (MOGA) is employed as an effective search method for finding outlying subspaces from training data to construct SST. SST is able to carry out online self-evolution in the detection stage to cope with dynamics of data streams. The experimental results demonstrate the efficiency and effectiveness of SPOT in detecting outliers in high-dimensional data streams.

Statistics for USQ ePrint 5620
Statistics for this ePrint Item
Item Type: Conference or Workshop Item (Commonwealth Reporting Category E) (Paper)
Refereed: Yes
Item Status: Live Archive
Additional Information (displayed to public): Author's version deposited in accordance with the copyright policy of the publisher. The original publication is available at
Depositing User: Dr Ji Zhang
Faculty / Department / School: Historic - Faculty of Sciences - Department of Maths and Computing
Date Deposited: 09 Sep 2009 23:53
Last Modified: 02 Jul 2013 23:23
Uncontrolled Keywords: stream projected outlier deTector; SPOT; outlier detection; atmospheric temperature; clustering algorithms; data communication systems; database systems; detectors
Fields of Research (FoR): 08 Information and Computing Sciences > 0801 Artificial Intelligence and Image Processing > 080109 Pattern Recognition and Data Mining
08 Information and Computing Sciences > 0806 Information Systems > 080604 Database Management
08 Information and Computing Sciences > 0802 Computation Theory and Mathematics > 080201 Analysis of Algorithms and Complexity
Socio-Economic Objective (SEO): E Expanding Knowledge > 97 Expanding Knowledge > 970108 Expanding Knowledge in the Information and Computing Sciences
Identification Number or DOI: doi: 10.1007/978-3-642-03573-9_53

Actions (login required)

View Item Archive Repository Staff Only