Skip to main content

Advertisement

Log in

An efficient pruning method for subgraph matching in large-scale graphs

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Subgraph matching, as a challenging problem in the graph area, has a wide range of applications from social networks to computational biology. It refers to finding the occurrences of a query graph in a given large graph. Subgraph matching is considered an NP-complete problem in the literature. In order to reduce the computational complexity of subgraph matching, pruning the search space with heuristics, e.g., topological features, have been studied broadly. In this paper, we propose a novel pruning strategy in subgraph matching to reduce the size of search space, named Eigen Decomposition Pruning. EDP uses local spectral features to prune the search space instead of directly using topological features. Firstly, it generates a Local Laplacian Matrix (LLM) for each candidate solution. Then, it prunes the false positive candidates from the search space using the spectral features of these LLMs. An LLM is defined to capture the features in the localities of a graph and imposes a low computation overhead. Additionally, EDP recognizes more false positive candidates than previous methods due to more informative spectral features of LLM, which results in a noticeable decrease in the overall time of subgraph matching. To assess the performance of our algorithm, it is applied to both real and synthetic datasets and is compared against four well-known methods in this field. The theoretical analysis and experimental results confirm the effectiveness of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Availability of data and materials

Not applicable.

References

  1. Pavlopoulos GA, Secrier M, Moschopoulos CN, Soldatos TG, Kossida S, Aerts J, Schneider R, Bagos PG (2011) Using graph theory to analyze biological networks. Biodata Min 4:10

    Article  Google Scholar 

  2. Viégas FB, Donath J, (2004) Social network visualization: can we go beyond the graph. In: Workshop on Social Networks, CSCW, pp 6–10

  3. Ma T, Yu S, Cao J, Tian Y, Al-Dhelaan A, Al-Rodhaan M (2018) A comparative study of subgraph matching isomorphic methods in social networks. IEEE Access 6:66621–66631

    Article  Google Scholar 

  4. Kijima S, Otachi Y, Saitoh T, Uno T (2012) Subgraph isomorphism in graph classes. Discret Math 312:3164–3173

    Article  MathSciNet  MATH  Google Scholar 

  5. Konagaya M, Otachi Y, Uehara R (2016) Polynomial-time algorithms for Subgraph Isomorphism in small graph classes of perfect graphs. Discret Appl Math 199:37–45

    Article  MathSciNet  MATH  Google Scholar 

  6. Kowaluk M, Lingas A (2018) Are unique subgraphs not easier to find? Inf Process Lett 134:57–61

    Article  MathSciNet  MATH  Google Scholar 

  7. Lin X, Zhang R, Wen Z, Wang H, Qi J (2014) Efficient subgraph matching using gpus. Australasian database conference. Springer, Cham, pp 74–85

    Google Scholar 

  8. Bouhenni S, Yahiaoui S, Nouali-Taboudjemat N, Kheddouci H (2022) Efficient parallel edge-centric approach for relaxed graph pattern matching. J Supercomput 78:1642–1671

    Article  Google Scholar 

  9. Fehér P, Asztalos M, Vajk T, Mészáros T, Lengyel L (2017) Detecting subgraph isomorphism with MapReduce. J Supercomput 73:1810–1851

    Article  Google Scholar 

  10. Sun S, Sun X, Che Y, Luo Q, He B (2020) Rapidmatch: a holistic approach to subgraph query processing. Proc VLDB Endow 14:176–188

    Article  Google Scholar 

  11. Ullmann JR (1976) An algorithm for subgraph isomorphism. J ACM 23:31–42

    Article  MathSciNet  Google Scholar 

  12. Kim H, Choi Y, Park K, Lin X, Hong S-H, Han W-S, (2021) Versatile equivalences: speeding up subgraph query processing and subgraph matching. In: Proceedings of the 2021 International Conference on Management of Data, pp 925–937

  13. He H, Singh AK (2008) Graphs-at-a-time: query language and access methods for graph databases. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, ACM, pp 405–418

  14. Lin Z, Bei Y (2014) Graph indexing for large networks: a neighborhood tree-based approach. Knowl Based Syst 72:48–59

    Article  Google Scholar 

  15. Zhao P, Han J (2010) On graph query optimization in large networks. Proc VLDB Endow 3:340–351

    Article  Google Scholar 

  16. Han W-S, Lee J, Lee J-H (2013) Turbo ISO: towards ultrafast and robust subgraph isomorphism search in large graph databases. In: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, ACM, pp 337–348

  17. Carletti V, Foggia P, Saggese A, Vento M (2017) Challenging the time complexity of exact subgraph isomorphism for huge and dense graphs with VF3. IEEE Trans Pattern Anal Mach Intell. https://doi.org/10.1109/TPAMI.2017.2696940

    Article  Google Scholar 

  18. Carletti V, Foggia P, Vento M (2015) VF2 Plus: an improved version of VF2 for biological graphs. International Workshop on Graph-Based Representations in Pattern Recognition. Springer, Cham, pp 168–177

    Google Scholar 

  19. Cordella LP, Foggia P, Sansone C, Vento M (2004) A (sub) graph isomorphism algorithm for matching large graphs. IEEE Trans Pattern Anal Mach Intell 26:1367–1372

    Article  Google Scholar 

  20. Dahm N, Bunke H, Caelli T, Gao Y (2015) Efficient subgraph matching using topological node feature constraints. Pattern Recogn 48:317–330

    Article  MATH  Google Scholar 

  21. Jüttner A, Madarasi P (2018) VF2++—An improved subgraph isomorphism algorithm. Discret Appl Math 242:69–81

    Article  MathSciNet  MATH  Google Scholar 

  22. Lee C-H, Chung C-W (2014) Efficient search in graph databases using cross filtering. Inf Sci 286:1–18

    Article  MathSciNet  MATH  Google Scholar 

  23. Lian X, Chen L, Wang G (2016) Quality-aware subgraph matching over inconsistent probabilistic graph databases. IEEE Trans Knowl Data Eng 28:1560–1574

    Article  Google Scholar 

  24. Solnon C (2010) Alldifferent-based filtering for subgraph isomorphism. Artif Intell 174:850–864

    Article  MathSciNet  MATH  Google Scholar 

  25. Sun Y, Wang W, Wu N, Liu C, Bhatia S, Yu Y, Yu W (2022) AAAN: Anomaly Alignment in Attributed Networks. Knowl Based Syst 249:108944

    Article  Google Scholar 

  26. Liu L, Du B, Tong H (2019) G-finder: approximate attributed subgraph matching. In: 2019 IEEE International Conference on Big Data, IEEE, pp 513–522

  27. Zeng L, Zou L, Özsu MT, Hu L, Zhang F (2020) GSI: GPU-friendly subgraph isomorphism. In: 2020 IEEE 36th International Conference on Data Engineering (ICDE), IEEE, pp 1249–1260

  28. Lan Z, Yu L, Yuan L, Wu Z, Niu Q, Ma F (2021) Sub-gmn: the subgraph matching network model. arXiv preprint arXiv:210400186

  29. Sun S, Luo Q (2020) Subgraph matching with effective matching order and indexing. IEEE Trans Knowl Data Eng 34:491–505

    Article  Google Scholar 

  30. Moorman JD, Tu TK, Chen Q, He X, Bertozzi AL (2021) Subgraph matching on multiplex networks. IEEE Trans Netw Sci Eng 8:1367–1384

    Article  MathSciNet  Google Scholar 

  31. Li F, Zou Z (2021) Subgraph matching on temporal graphs. Inf Sci 578:539–558

    Article  MathSciNet  Google Scholar 

  32. Sun Y, Li G, Du J, Ning B, Chen H (2022) A subgraph matching algorithm based on subgraph index for knowledge graph. Front Comput Sci 16:1–18

    Article  Google Scholar 

  33. Kim H, Choi Y, Park K, Lin X, Hong S-H, Han W-S (2022) Fast subgraph query processing and subgraph matching via static and dynamic equivalences. VLDB J. https://doi.org/10.1007/s00778-022-00749-x

    Article  Google Scholar 

  34. Micale G, Bonnici V, Ferro A, Shasha D, Giugno R, Pulvirenti A (2020) Multiri: fast subgraph matching in labeled multigraphs. arXiv preprint arXiv:200311546

  35. Mawhirter D, Reinehr S, Holmes C, Liu T, Wu B (2021) Graphzero: a high-performance subgraph matching system. ACM SIGOPS Oper Syst Rev 55:21–37

    Article  Google Scholar 

  36. Arai J, Onizuka M, Fujiwara Y, Iwamura S (2020) Fast subgraph matching by exploiting search failures. arXiv preprint arXiv:201214420

  37. Anderson WN Jr, Morley TD (1985) Eigenvalues of the laplacian of a graph∗. Linear Multilinear Algebra 18:141–145

    Article  MathSciNet  MATH  Google Scholar 

  38. Harville DA (1998) Matrix algebra from a statistician’s perspective. Taylor & Francis, New York

    MATH  Google Scholar 

  39. Zhan C, Chen G, Yeung LF (2010) On the distributions of laplacian eigenvalues versus node degrees in complex networks. Phys A 389:1779–1788

    Article  Google Scholar 

  40. Cozzo E, de Arruda GF, Rodrigues FA, Moreno Y (2016) Multilayer networks: metrics and spectral properties. Interconnected networks. Springer, Cham, pp 17–35

    Chapter  MATH  Google Scholar 

  41. Mahdi G, Chakraborty A, Arnold ME, Rebelo AG (2019) Efficient Bayesian modeling of large lattice data using spectral properties of laplacian matrix. Spat Stat. https://doi.org/10.1016/j.spasta.2019.01.003

    Article  MathSciNet  Google Scholar 

  42. Fiori M, Sapiro G (2015) On spectral properties for graph matching and graph isomorphism problems. Inf Inference J IMA 4:63–76

    Article  MathSciNet  MATH  Google Scholar 

  43. Raviv D, Kimmel R, Bruckstein AM (2013) Graph isomorphisms and automorphisms via spectral signatures. IEEE Trans Pattern Anal Mach Intell 35:1985–1993

    Article  Google Scholar 

  44. Chakrabarti D, Zhan Y, Faloutsos C R-MAT: A recursive model for graph mining. In: Proceedings of the 2004 SIAM International Conference on Data Mining, 2004. SIAM, pp 442–446

Download references

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

H. Moayed developed the theoretical formalism, performed the analytic calculations and performed the numerical simulations. E. Mansoori supervised the findings of this work. All authors discussed the results and contributed to the final manuscript.

Corresponding author

Correspondence to Hojjat Moayed.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Ethical approval

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Moayed, H., Mansoori, E.G. & Moosavi, M.R. An efficient pruning method for subgraph matching in large-scale graphs. J Supercomput 79, 10511–10532 (2023). https://doi.org/10.1007/s11227-023-05061-1

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-023-05061-1

Keywords

Navigation

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy