Abstract
With the expansion of application scale, the dependence of microservices becomes more and more complex, which makes it difficult to accurately locate the root cause of failure. At present, many microservices fault diagnosis methods have the problems of high cost of fine-grained location and difficult to predict the fault propagation path, which leads to more false positives and low diagnosis efficiency. Therefore, this paper proposes a microservice fault diagnosis method based on correlation analysis. Firstly, the historical exception and fault data of microservices is collected and processed, and the exception event is defined. Secondly, the machine learning algorithm is used to mine the correlation between microservices and abnormal events, and the hierarchical correlation graph is constructed. Finally, a microservice fault diagnosis mechanism based on a hierarchical correlation graph is designed to infer the root cause of the fault and provide effective diagnosis information for administrators. Experimental results show that the mechanism can quickly locate the fault location and infer the root cause of microservice failure in a more fine-grained way.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Merkel, D.: Docker: lightweight linux containers for consistent development and deployment. Linux J. 2014(239), 76–90 (2014)
Romsaiyud, W., Premchaiswadi, W.: An adaptive machine learning on Map-Reduce framework for improving performance of large-scale data analysis on EC2. In: 2013 Eleventh International Conference on ICT and Knowledge Engineering, pp. 1–7. IEEE (2013)
Sigelman, B.H., Barroso, L.A., Burrows, M., et al.: Dapper, a large-scale distributed systems tracing infrastructure. Technical report, Google (2010)
Mallik, S., Bhadra, T., Mukherji, A.: DTFP-growth: dynamic threshold based FP-growth rule mining algorithm through integrating gene expression, methylation and protein-protein interaction profiles. IEEE Trans. NanoBiosci. 17(2), 117–125 (2018)
Zhou, X., Peng, X., et al.: Delta debugging microservice systems with parallel optimization. IEEE Trans. Serv. Comput. 15, 16–29 (2019)
Guo, A., Mao, X., Yang, D., et al.: An empirical study on the effect of dynamic slicing on automated program repair efficiency. In: 2018 IEEE International Conference on Software Maintenance and Evolution (ICSME), pp. 554–558. IEEE (2018)
Ma, S.P., Fan, C.Y., Chuang, Y., et al.: Using service dependency graph to analyze and test microservices. In: 2018 IEEE 42nd Annual Computer Software and Applications Conference (COMPSAC), pp. 81–86. IEEE (2018)
Zhou, X., Peng, X., Xie, T., et al.: Latent error prediction and fault localization for microservice applications by learning from system trace logs. In: Proceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, pp. 683–694. ACM (2019)
Li, X., Li, W., Zhang, Y., et al.: DeepFL: integrating multiple fault diagnosis dimensions for deep fault localization. In: Proceedings of the 28th ACM SIGSOFT International Symposium on Software Testing and Analysis, pp. 169–180. ACM (2019)
Yu, X., Liu, J., Yang, Z., et al.: The Bayesian Network based program dependence graph and its application to fault localization. J. Syst. Softw. 134(1), 44–53 (2017)
Jia, T., Chen, P., Yang, L., et al.: An approach for anomaly diagnosis based on hybrid graph model with logs for distributed services. In: 2017 IEEE International Conference on Web Services (ICWS), pp. 25–32. IEEE (2017)
Biau, G.: Analysis of a random forests model. J. Mach. Learn. Res. 13(2), 1063–1095 (2010)
Xiao, J., Stolkin, R., Leonardis, A.: Single target tracking using adaptive clustered decision trees and dynamic multi-level appearance models. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 4978–4987. IEEE (2015)
Feng, X., Li, S., Yuan, C., et al.: Prediction of slope stability using Naive Bayes Classifier. KSCE J. Civ. Eng. 22(3), 941–950 (2018)
Sockshop. https://github.com/microservices-demo/microservices-demo
Acknowledgements
This work is supported by the Natural Science Foundation of China (No. 61762008), the National Key Research and Development Project of China (No. 2018YFB1404404), the Major special project of science and technology of Guangxi (No. AA18118047-7), and the Guangxi Natural Science Foundation Project (No. 2017GXNSFAA198141).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Tang, Y., Chen, N., Zhang, Y., Yao, X., Yu, S. (2022). Fine-Grained Diagnosis Method for Microservice Faults Based on Hierarchical Correlation Analysis. In: Sun, Y., et al. Computer Supported Cooperative Work and Social Computing. ChineseCSCW 2021. Communications in Computer and Information Science, vol 1491. Springer, Singapore. https://doi.org/10.1007/978-981-19-4546-5_2
Download citation
DOI: https://doi.org/10.1007/978-981-19-4546-5_2
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-4545-8
Online ISBN: 978-981-19-4546-5
eBook Packages: Computer ScienceComputer Science (R0)