Abstract
Computed tomography (CT) is an important technique that is widely used in disease screening and diagnosis. In order to assist doctors in diagnosis and treatment plans, an efficient and accurate automatic image segmentation technology is urgently needed. CT images of different lesions always have problems such as different resolutions, different numbers of lesions, and inconspicuous contrast between lesions and background areas, which brings considerable challenges to the automated segmentation process. To this end, we propose a dual-path self-attention multi-scale feature fusion network (DS-MSFF-Net) that fuses self-attention mechanism and dilated convolution. It is worth noting that this network includes two parallel branch paths, which enables it to extract long-range semantic feature information effectively while extracting detailed feature information of CT images. Additionally, a novel feature extraction module is designed to focus limited learning resources on low-resolution high-order semantic feature maps, which can improve the segmentation accuracy without significant additional computational overhead. We extensively evaluate our method on the LIDC-IDRI lung nodule segmentation dataset and the LiTS2017 liver segmentation dataset, which outperforms other recent state-of-the-art methods on various CT image segmentation tasks.









Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability and access
The data that support the findings of this study are available from the corresponding author upon reasonable request.
References
Jun Y, Zhu C, Zhang J, Huang Q, Tao D (2019) Spatial pyramid-enhanced netvlad with weighted triplet loss for place recognition. IEEE Trans Neural Netw Learn Syst 31(2):661–674
Liu F, Song Q, Jin G (2020) The classification and denoising of image noise based on deep neural networks. Appl Intell 50(7):2194–2207
Ronneberger O, Fischer P, Brox T (2015) U-net: Convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention, pp 234–241. Springer
Kirillov A, Wu Y, He K, Girshick R (2020) Pointrend: Image segmentation as rendering. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 9799–9808
Zhang Y, Bai Y, Ding M, Shibiao X, Ghanem B (2020) Kgsnet: key-point-guided super-resolution network for pedestrian detection in the wild. IEEE Trans Neural Netw Learn Syst 32(5):2251–2265
Pal SK, Pramanik A, Maiti J, Mitra P (2021) Deep learning in multi-object detection and tracking: state of the art. Appl Intell 51(9):6400–6429
Shin Y-G, Sagong M-C, Yeo Y-J, Kim S-W, Ko S-J (2020) Pepsi++: Fast and lightweight network for image inpainting. IEEE Trans Neural Netw Learn Syst 32(1):252–265
Chen Y, Zhang H, Liu L, Chen X, Zhang Q, Yang K, Xia R, Xie J (2021) Research on image inpainting algorithm of improved gan based on two-discriminations networks. Appl Intell 51(6):3460–3474
Hong C, Jun Y, Zhang J, Jin X, Lee K-H (2018) Multimodal face-pose estimation with multitask manifold deep learning. IEEE Trans Industr Inform 15(7):3952–3961
Zhongxu H, Youmin H, Bo W, Liu J, Han D, Kurfess T (2018) Hand pose estimation with multi-scale network. Appl Intell 48(8):2501–2515
Li X, Zhou Y, Peng D, Lang G, Min X, Wei W (2021) A deep learning system that generates quantitative ct reports for diagnosing pulmonary tuberculosis. Appl Intell 51(6):4082–4093
Xia H, Ma M, Li H, Song S (2021) Mc-net: multi-scale context-attention network for medical ct image segmentation. Appl Intell 1–12
Liao F, Liang M, Li Z, Xiaolin H, Song S (2019) Evaluate the malignancy of pulmonary nodules using the 3-d deep leaky noisy-or network. IEEE Trans Neural Netw Learn Syst 30(11):3484–3495
Roy R, Chakraborti T, Chowdhury AS (2019) A deep learning-shape driven level set synergism for pulmonary nodule segmentation. Pattern Recognit Lett 123:31–38
Shakibapour E, Cunha A, Aresta G, Mendonça AM, Campilho A (2019) An unsupervised metaheuristic search approach for segmentation and volume measurement of pulmonary nodules in lung ct scans. Exp Syst Appl 119:415–428
Zhu W, Liu C, Fan W, Xie X (2018) Deeplung: Deep 3d dual path nets for automated pulmonary nodule detection and classification. In: 2018 IEEE Winter conference on applications of computer vision (WACV), pp 673–681. IEEE
Wei Y, Liang X, Chen Y, Shen X, Cheng M-M, Feng J, Zhao Y, Yan S (2016) Stc: A simple to complex framework for weakly-supervised semantic segmentation. IEEE Trans Pattern Anal Mach Intell 39(11):2314–2320
Li M, Hsu W, Xie X, Cong J, Gao W (2020) Sacnn: Self-attention convolutional neural network for low-dose ct denoising with self-supervised perceptual loss network. IEEE Trans Med Imaging 39(7):2289–2301
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440
Christ PF, Ettlinger F, Grün F, Elshaera MEA, Lipkova J, Schlecht S, Ahmaddy F, Tatavarty S, Bickel M, Bilic P et al (2017) Automatic liver and tumor segmentation of ct and mri volumes using cascaded fully convolutional neural networks. arXiv:1702.05970
Bi L, Feng D, Kim J (2018) Dual-path adversarial learning for fully convolutional network (fcn)-based medical image segmentation. Vis Comput 34(6):1043–1052
Wu Y, Lin L (2020) Automatic lung segmentation in ct images using dilated convolution based weighted fully convolutional network. In: Journal of Physics: Conference Series, vol 1646, pp 012032. IOP Publishing
Chen L-C, Papandreou G, Kokkinos I, Murphy K, Yuille AL (2017) Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans Pattern Anal Mach Intell 40(4):834–848
Men K, Dai J, Li Y (2017) Automatic segmentation of the clinical target volume and organs at risk in the planning ct for rectal cancer using deep dilated convolutional neural networks. Med Phys 44(12):6377–6389
Xia H, Sun W, Song S, Mou X (2020) Md-net: multi-scale dilated convolution network for ct images segmentation. Neural Process Lett 51(3):2915–2927
Liu Z, Liu X, Xiao B, Wang S, Miao Z, Sun Y, Zhang F (2020) Segmentation of organs-at-risk in cervical cancer ct images with a convolutional neural network. Phys Med 69:184–191
Zhao X, Zhang P, Song F, Fan G, Sun Y, Wang Y, Tian Z, Zhang L, Zhang G (2021) D2a u-net: Automatic segmentation of covid-19 ct slices based on dual attention and hybrid dilated convolution. Comput Biol Med 104526
Zhou Z, Siddiquee Md MR, Tajbakhsh N, Liang J (2018) Unet++: A nested u-net architecture for medical image segmentation. In: Deep learning in medical image analysis and multimodal learning for clinical decision support, pp 3–11. Springer
Li X, Chen H, Qi X, Dou Q, Chi-Wing F, Heng P-A (2018) H-denseunet: hybrid densely connected unet for liver and tumor segmentation from ct volumes. IEEE Trans Med Imaging 37(12):2663–2674
He K, Lian C, Zhang B, Zhang X, Cao X, Nie D, Gao Y, Zhang J, Shen D (2021) Hf-unet: learning hierarchically inter-task relevance in multi-task u-net for accurate prostate segmentation in ct images. IEEE Trans Med Imaging 40(8):2118–2128
Khan RA, Luo Y, Wu F-X (2022) Rms-unet: Residual multi-scale unet for liver and lesion segmentation. Artif Intell Med 124:102231
Yin S, Deng H, Zelin X, Zhu Q, Cheng J (2022) Sd-unet: A novel segmentation framework for ct images of lung infections. Electronics 11(1):130
Mehta S, Rastegari M, Caspi A, Shapiro L, Hajishirzi H (2018) Espnet: Efficient spatial pyramid of dilated convolutions for semantic segmentation. In: Proceedings of the european conference on computer vision (ECCV), pp 552–568
Chen Y, Wang K, Liao X, Qian Y, Wang Q, Yuan Z, Heng P-A (2019) Channel-unet: a spatial channel-wise convolutional neural network for liver and tumors segmentation. Front Genet 10:1110
Kushnure DT, Talbar SN (2021) Ms-unet: A multi-scale unet with feature recalibration approach for automatic liver and tumor segmentation in ct images. Comput Med Imaging Graph 89:101885
Xie X, Pan X, Shao F, Zhang W, An J (2022) Mci-net: Multi-scale context integrated network for liver ct image segmentation. Comput Electric Eng 101:108085
Xia H, Ma M, Li H, Song S (2022) Mc-net: multi-scale context-attention network for medical ct image segmentation. Appl Intell 52(2):1508–1519
Woo S, Park J, Lee J-Y, Kweon IS (2018) Cbam: Convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19
Srinivas A, Lin T-Y, Parmar N, Shlens J, Abbeel P, Vaswani A (2021) Bottleneck transformers for visual recognition. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 16519–16529
Sinha A, Dolz J (2020) Multi-scale self-guided attention for medical image segmentation. IEEE J Biomed Health Inform 25(1):121–130
Zhang C, Jingben L, Hua Q, Li C, Wang P (2022) Saa-net: U-shaped network with scale-axis-attention for liver tumor segmentation. Biomed Signal Process Control 73:103460
Tang H, Liu X, Han K, Xie X, Chen X, Qian H, Liu Y, Sun S, Bai N (2021) Spatial context-aware self-attention model for multi-organ segmentation. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision, pp 939–949
Chen L-C, Papandreou G, Schroff F, Adam H (2017) Rethinking atrous convolution for semantic image segmentation. arXiv:1706.05587
Chen L-C, Zhu Y, Papandreou G, Schroff F, Adam H (2018) Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European conference on computer vision (ECCV), pp 801–818
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Wang C-Y, Liao H-YM, Wu Y-H, Chen P-Y, Hsieh J-W, Yeh I-H (2020) Cspnet: A new backbone that can enhance learning capability of cnn. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, pp 390–391
Cao Y, Xu J, Lin S, Wei F, Hu H (2019) Gcnet: Non-local networks meet squeeze-excitation networks and beyond. In: Proceedings of the IEEE/CVF international conference on computer vision workshops
Wang Q, Wu B, Zhu P, Li P, Zuo W, Hu Q (2020) Eca-net: efficient channel attention for deep convolutional neural networks, 2020 IEEE. In: CVF conference on computer vision and pattern recognition (CVPR). IEEE
Maqsood M, Yasmin S, Mehmood I, Bukhari M, Kim M (2021) An efficient da-net architecture for lung nodule segmentation. Mathematics 9(13):1457
Shuo Wang M, Zhou ZL, Liu Z, Dongsheng G, Zang Y, Dong D, Gevaert O, Tian J (2017) Central focused convolutional neural networks: Developing a data-driven model for lung nodule segmentation. Med Image Anal 40:172–183
Jiang J, Hu Y-C, Liu C-J, Halpenny D, Hellmann MD, Deasy JO, Mageras G, Veeraraghavan H (2018) Multiple resolution residually connected feature streams for automatic lung tumor segmentation from ct images. IEEE Tran Med Imaging 38(1):134–144
Huang X, Shan J, Vaidya V (2017) Lung nodule detection in ct using 3d convolutional neural networks. In: 2017 IEEE 14th international symposium on biomedical imaging (ISBI 2017), pp 379–383. IEEE
Zhao X, Sun W, Qian W, Qi S, Sun J, Zhang B, Yang Z (2019) Fine-grained lung nodule segmentation with pyramid deconvolutional neural network. In Medical Imaging 2019: Computer-Aided Diagnosis, vol 10950, pp 956–961. SPIE
Hancock MC, Magnan JF (2019) Lung nodule segmentation via level set machine learning. arXiv:1910.03191
Huang X, Sun W, Tseng T-LB, Li C, Qian W (2019) Fast and fully-automated detection and segmentation of pulmonary nodules in thoracic ct scans using deep convolutional neural networks. Comput Med Imaging Graph 74:25–36
Wang J, Lv P, Wang H, Shi C (2021) Sar-u-net: squeeze-and-excitation block and atrous spatial pyramid pooling based residual u-net for automatic liver segmentation in computed tomography. Comput Methods Programs Biomed 208:106268
Oktay O, Schlemper J, Folgoc LL, Lee M, Heinrich M, Misawa K, Mori K, McDonagh S, Hammerla NY, Kainz B et al (2018) Attention u-net: Learning where to look for the pancreas. arXiv:1804.03999
Changling L, Xiangfen S, Hang Z, Li F, Tao H, Yuchen Z, Jun J, Jianan W, Jianping X, Yong S (2021) An 8-layer residual u-net with deep supervision for segmentation of the left ventricle in cardiac ct angiography. Comput Methods Programs Biomed 200:105876
Acknowledgements
This work has been supported in part by the National Natural Science Foundation of China under Grant 62102331, 62176125 and 61772272, which also has been supported in part by Natural Science Foundation of Sichuan Province 2022NSFSC0839 and Southwest University of Science and Technology Doctoral Research Fund Project 22zx7110.
Author information
Authors and Affiliations
Contributions
Xiaoqian Zhang: Software, Writing-Reviewing and Editing. Lei Pu: Conceptualization, Methodology, Software. Liming Wan: Writing-Original draft preparation, Data curation, Software. Xiao Wang: Software, Editing. Ying Zhou: Data curation.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical and informed consent for data used
This article does not contain any studies with human participants or animals performed by the author. We obtain ethical and informed consent from data subjects before collecting, using, or disclosing their personal data.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhang, X., Pu, L., Wan, L. et al. DS-MSFF-Net: Dual-path self-attention multi-scale feature fusion network for CT image segmentation. Appl Intell 54, 4490–4506 (2024). https://doi.org/10.1007/s10489-024-05372-7
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-024-05372-7