A two-stage information filtering based on rough decision rule and pattern mining

Zhou, Xujuan and Li, Yuefeng and Bruza, Peter and Xu, Yue and Lau, Raymond (2010) A two-stage information filtering based on rough decision rule and pattern mining. Journal of Emerging Technologies in Web Intelligence, 2 (4). pp. 326-332. ISSN 1798-0461


Information Overload and Mismatch are two fundamental problems affecting the effectiveness of information filtering systems. Even though both term-based and patternbased approaches have been proposed to address the problems of overload and mismatch, neither of these approaches alone can provide a satisfactory solution to address these problems. This paper presents a novel two-stage information filtering model which combines the merits of term-based and pattern-based approaches to effectively filter sheer volume of information. In particular, the first filtering stage is supported by a novel rough analysis model which efficiently removes a large number of irrelevant documents, thereby addressing the overload problem. The second filtering stage is empowered by a semantically rich pattern taxonomy mining model which effectively fetches incoming documents according to the specific information needs of a user, thereby addressing the mismatch problem. The experimental results based on the RCV1 corpus show that the proposed twostage filtering model significantly outperforms the both termbased and pattern-based information filtering models.

Statistics for USQ ePrint 30979
Statistics for this ePrint Item
Item Type: Article (Commonwealth Reporting Category C)
Refereed: Yes
Item Status: Live Archive
Additional Information: Files associated with this item cannot be displayed due to copyright restrictions.
Faculty/School / Institute/Centre: No Faculty
Faculty/School / Institute/Centre: No Faculty
Date Deposited: 30 May 2017 05:26
Last Modified: 01 Nov 2017 02:24
Uncontrolled Keywords: information filtering; user profiles; rough set theory; pattern mining
Fields of Research (2008): 08 Information and Computing Sciences > 0806 Information Systems > 080699 Information Systems not elsewhere classified
Fields of Research (2020): 46 INFORMATION AND COMPUTING SCIENCES > 4609 Information systems > 460999 Information systems not elsewhere classified
Identification Number or DOI: https://doi.org/10.4304/jetwi.2.4.326-332
URI: http://eprints.usq.edu.au/id/eprint/30979

Actions (login required)

View Item Archive Repository Staff Only