Extending Graph Pattern Matching with Regular Expressions

Wang, Xin and Wang, Yang and Xu, Yang and Zhang, Ji and Zhong, Xueyan (2020) Extending Graph Pattern Matching with Regular Expressions. In: 31st International Conference on Database and Expert Systems Applications (DEXA 2020), 14–17 Sept 2020, Bratislava, Slovakia.


Abstract

Graph pattern matching, which is to compute the set M(Q, G) of matches of Q in G, for the given pattern graph Q and data graph G, has been increasingly used in emerging applications e.g., social network analysis. As the matching semantic is typically defined in terms of subgraph isomorphism, two key issues are hence raised: the semantic is often too rigid to identify meaningful matches, and the problem is intractable, which calls for efficient matching methods. Motivated by these, this paper extends matching semantic with regular expressions, and investigates the top-k graph pattern matching problem. (1) We introduce regular patterns, which revise traditional pattern graphs by incorporating regular expressions; extend traditional matching semantic by allowing edge to regular path mapping. With the extension, more meaningful matches could be captured. (2) We propose a relevance function, that is defined in terms of tightness of connectivity, for ranking matches. Based on the ranking function, we introduce the top-k graph pattern matching problem, denoted by TopK. (3) We show that TopK is intractable. Despite hardness, we develop an algorithm with early termination property, i.e., it finds top-k matches without identifying entire match set. (4) Using real-life and synthetic data, we experimentally verify that our top-k matching algorithms are effective, and outperform traditional counterparts.


Statistics for USQ ePrint 41396
Statistics for this ePrint Item
Item Type: Conference or Workshop Item (Commonwealth Reporting Category E) (Paper)
Refereed: Yes
Item Status: Live Archive
Faculty/School / Institute/Centre: Current - Faculty of Health, Engineering and Sciences - School of Sciences (6 Sep 2019 -)
Faculty/School / Institute/Centre: Current - Faculty of Health, Engineering and Sciences - School of Sciences (6 Sep 2019 -)
Date Deposited: 22 Feb 2021 03:48
Last Modified: 26 Feb 2021 04:17
Uncontrolled Keywords: Early termination; Emerging applications; Graph pattern matching; Matching methods; Ranking functions; Regular expressions; Regular patterns; Subgraph isomorphism
Fields of Research (2008): 08 Information and Computing Sciences > 0804 Data Format > 080403 Data Structures
Fields of Research (2020): 46 INFORMATION AND COMPUTING SCIENCES > 4605 Data management and data science > 460503 Data models, storage and indexing
Identification Number or DOI: https://doi.org/10.1007/978-3-030-59051-2_8
URI: http://eprints.usq.edu.au/id/eprint/41396

Actions (login required)

View Item Archive Repository Staff Only