Abstract
Subgraph matching, as a challenging problem in the graph area, has a wide range of applications from social networks to computational biology. It refers to finding the occurrences of a query graph in a given large graph. Subgraph matching is considered an NP-complete problem in the literature. In order to reduce the computational complexity of subgraph matching, pruning the search space with heuristics, e.g., topological features, have been studied broadly. In this paper, we propose a novel pruning strategy in subgraph matching to reduce the size of search space, named Eigen Decomposition Pruning. EDP uses local spectral features to prune the search space instead of directly using topological features. Firstly, it generates a Local Laplacian Matrix (LLM) for each candidate solution. Then, it prunes the false positive candidates from the search space using the spectral features of these LLMs. An LLM is defined to capture the features in the localities of a graph and imposes a low computation overhead. Additionally, EDP recognizes more false positive candidates than previous methods due to more informative spectral features of LLM, which results in a noticeable decrease in the overall time of subgraph matching. To assess the performance of our algorithm, it is applied to both real and synthetic datasets and is compared against four well-known methods in this field. The theoretical analysis and experimental results confirm the effectiveness of the proposed method.








Similar content being viewed by others
Availability of data and materials
Not applicable.
References
Pavlopoulos GA, Secrier M, Moschopoulos CN, Soldatos TG, Kossida S, Aerts J, Schneider R, Bagos PG (2011) Using graph theory to analyze biological networks. Biodata Min 4:10
Viégas FB, Donath J, (2004) Social network visualization: can we go beyond the graph. In: Workshop on Social Networks, CSCW, pp 6–10
Ma T, Yu S, Cao J, Tian Y, Al-Dhelaan A, Al-Rodhaan M (2018) A comparative study of subgraph matching isomorphic methods in social networks. IEEE Access 6:66621–66631
Kijima S, Otachi Y, Saitoh T, Uno T (2012) Subgraph isomorphism in graph classes. Discret Math 312:3164–3173
Konagaya M, Otachi Y, Uehara R (2016) Polynomial-time algorithms for Subgraph Isomorphism in small graph classes of perfect graphs. Discret Appl Math 199:37–45
Kowaluk M, Lingas A (2018) Are unique subgraphs not easier to find? Inf Process Lett 134:57–61
Lin X, Zhang R, Wen Z, Wang H, Qi J (2014) Efficient subgraph matching using gpus. Australasian database conference. Springer, Cham, pp 74–85
Bouhenni S, Yahiaoui S, Nouali-Taboudjemat N, Kheddouci H (2022) Efficient parallel edge-centric approach for relaxed graph pattern matching. J Supercomput 78:1642–1671
Fehér P, Asztalos M, Vajk T, Mészáros T, Lengyel L (2017) Detecting subgraph isomorphism with MapReduce. J Supercomput 73:1810–1851
Sun S, Sun X, Che Y, Luo Q, He B (2020) Rapidmatch: a holistic approach to subgraph query processing. Proc VLDB Endow 14:176–188
Ullmann JR (1976) An algorithm for subgraph isomorphism. J ACM 23:31–42
Kim H, Choi Y, Park K, Lin X, Hong S-H, Han W-S, (2021) Versatile equivalences: speeding up subgraph query processing and subgraph matching. In: Proceedings of the 2021 International Conference on Management of Data, pp 925–937
He H, Singh AK (2008) Graphs-at-a-time: query language and access methods for graph databases. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, ACM, pp 405–418
Lin Z, Bei Y (2014) Graph indexing for large networks: a neighborhood tree-based approach. Knowl Based Syst 72:48–59
Zhao P, Han J (2010) On graph query optimization in large networks. Proc VLDB Endow 3:340–351
Han W-S, Lee J, Lee J-H (2013) Turbo ISO: towards ultrafast and robust subgraph isomorphism search in large graph databases. In: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, ACM, pp 337–348
Carletti V, Foggia P, Saggese A, Vento M (2017) Challenging the time complexity of exact subgraph isomorphism for huge and dense graphs with VF3. IEEE Trans Pattern Anal Mach Intell. https://doi.org/10.1109/TPAMI.2017.2696940
Carletti V, Foggia P, Vento M (2015) VF2 Plus: an improved version of VF2 for biological graphs. International Workshop on Graph-Based Representations in Pattern Recognition. Springer, Cham, pp 168–177
Cordella LP, Foggia P, Sansone C, Vento M (2004) A (sub) graph isomorphism algorithm for matching large graphs. IEEE Trans Pattern Anal Mach Intell 26:1367–1372
Dahm N, Bunke H, Caelli T, Gao Y (2015) Efficient subgraph matching using topological node feature constraints. Pattern Recogn 48:317–330
Jüttner A, Madarasi P (2018) VF2++—An improved subgraph isomorphism algorithm. Discret Appl Math 242:69–81
Lee C-H, Chung C-W (2014) Efficient search in graph databases using cross filtering. Inf Sci 286:1–18
Lian X, Chen L, Wang G (2016) Quality-aware subgraph matching over inconsistent probabilistic graph databases. IEEE Trans Knowl Data Eng 28:1560–1574
Solnon C (2010) Alldifferent-based filtering for subgraph isomorphism. Artif Intell 174:850–864
Sun Y, Wang W, Wu N, Liu C, Bhatia S, Yu Y, Yu W (2022) AAAN: Anomaly Alignment in Attributed Networks. Knowl Based Syst 249:108944
Liu L, Du B, Tong H (2019) G-finder: approximate attributed subgraph matching. In: 2019 IEEE International Conference on Big Data, IEEE, pp 513–522
Zeng L, Zou L, Özsu MT, Hu L, Zhang F (2020) GSI: GPU-friendly subgraph isomorphism. In: 2020 IEEE 36th International Conference on Data Engineering (ICDE), IEEE, pp 1249–1260
Lan Z, Yu L, Yuan L, Wu Z, Niu Q, Ma F (2021) Sub-gmn: the subgraph matching network model. arXiv preprint arXiv:210400186
Sun S, Luo Q (2020) Subgraph matching with effective matching order and indexing. IEEE Trans Knowl Data Eng 34:491–505
Moorman JD, Tu TK, Chen Q, He X, Bertozzi AL (2021) Subgraph matching on multiplex networks. IEEE Trans Netw Sci Eng 8:1367–1384
Li F, Zou Z (2021) Subgraph matching on temporal graphs. Inf Sci 578:539–558
Sun Y, Li G, Du J, Ning B, Chen H (2022) A subgraph matching algorithm based on subgraph index for knowledge graph. Front Comput Sci 16:1–18
Kim H, Choi Y, Park K, Lin X, Hong S-H, Han W-S (2022) Fast subgraph query processing and subgraph matching via static and dynamic equivalences. VLDB J. https://doi.org/10.1007/s00778-022-00749-x
Micale G, Bonnici V, Ferro A, Shasha D, Giugno R, Pulvirenti A (2020) Multiri: fast subgraph matching in labeled multigraphs. arXiv preprint arXiv:200311546
Mawhirter D, Reinehr S, Holmes C, Liu T, Wu B (2021) Graphzero: a high-performance subgraph matching system. ACM SIGOPS Oper Syst Rev 55:21–37
Arai J, Onizuka M, Fujiwara Y, Iwamura S (2020) Fast subgraph matching by exploiting search failures. arXiv preprint arXiv:201214420
Anderson WN Jr, Morley TD (1985) Eigenvalues of the laplacian of a graph∗. Linear Multilinear Algebra 18:141–145
Harville DA (1998) Matrix algebra from a statistician’s perspective. Taylor & Francis, New York
Zhan C, Chen G, Yeung LF (2010) On the distributions of laplacian eigenvalues versus node degrees in complex networks. Phys A 389:1779–1788
Cozzo E, de Arruda GF, Rodrigues FA, Moreno Y (2016) Multilayer networks: metrics and spectral properties. Interconnected networks. Springer, Cham, pp 17–35
Mahdi G, Chakraborty A, Arnold ME, Rebelo AG (2019) Efficient Bayesian modeling of large lattice data using spectral properties of laplacian matrix. Spat Stat. https://doi.org/10.1016/j.spasta.2019.01.003
Fiori M, Sapiro G (2015) On spectral properties for graph matching and graph isomorphism problems. Inf Inference J IMA 4:63–76
Raviv D, Kimmel R, Bruckstein AM (2013) Graph isomorphisms and automorphisms via spectral signatures. IEEE Trans Pattern Anal Mach Intell 35:1985–1993
Chakrabarti D, Zhan Y, Faloutsos C R-MAT: A recursive model for graph mining. In: Proceedings of the 2004 SIAM International Conference on Data Mining, 2004. SIAM, pp 442–446
Funding
Not applicable.
Author information
Authors and Affiliations
Contributions
H. Moayed developed the theoretical formalism, performed the analytic calculations and performed the numerical simulations. E. Mansoori supervised the findings of this work. All authors discussed the results and contributed to the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Ethical approval
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Moayed, H., Mansoori, E.G. & Moosavi, M.R. An efficient pruning method for subgraph matching in large-scale graphs. J Supercomput 79, 10511–10532 (2023). https://doi.org/10.1007/s11227-023-05061-1
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-023-05061-1