DS-MSFF-Net: Dual-path self-attention multi-scale feature fusion network for CT image segmentation

Zhang, Xiaoqian; Pu, Lei; Wan, Liming; Wang, Xiao; Zhou, Ying

doi:10.1007/s10489-024-05372-7

DS-MSFF-Net: Dual-path self-attention multi-scale feature fusion network for CT image segmentation

Published: 27 March 2024

Volume 54, pages 4490–4506, (2024)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Xiaoqian Zhang ORCID: orcid.org/0000-0002-4007-4501¹,
Lei Pu¹,
Liming Wan²,
Xiao Wang¹ &
…
Ying Zhou³

767 Accesses
1 Altmetric
Explore all metrics

Abstract

Computed tomography (CT) is an important technique that is widely used in disease screening and diagnosis. In order to assist doctors in diagnosis and treatment plans, an efficient and accurate automatic image segmentation technology is urgently needed. CT images of different lesions always have problems such as different resolutions, different numbers of lesions, and inconspicuous contrast between lesions and background areas, which brings considerable challenges to the automated segmentation process. To this end, we propose a dual-path self-attention multi-scale feature fusion network (DS-MSFF-Net) that fuses self-attention mechanism and dilated convolution. It is worth noting that this network includes two parallel branch paths, which enables it to extract long-range semantic feature information effectively while extracting detailed feature information of CT images. Additionally, a novel feature extraction module is designed to focus limited learning resources on low-resolution high-order semantic feature maps, which can improve the segmentation accuracy without significant additional computational overhead. We extensively evaluate our method on the LIDC-IDRI lung nodule segmentation dataset and the LiTS2017 liver segmentation dataset, which outperforms other recent state-of-the-art methods on various CT image segmentation tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Hybrid Multi-atrous and Multi-scale Network for Liver Lesion Detection

Multi-scale feature pyramid fusion network for medical image segmentation

Article 30 August 2022

CT-UFormer: an improved hybrid decoder for image segmentation

Article 05 December 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Data availability and access

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

Jun Y, Zhu C, Zhang J, Huang Q, Tao D (2019) Spatial pyramid-enhanced netvlad with weighted triplet loss for place recognition. IEEE Trans Neural Netw Learn Syst 31(2):661–674
Google Scholar
Liu F, Song Q, Jin G (2020) The classification and denoising of image noise based on deep neural networks. Appl Intell 50(7):2194–2207
Article Google Scholar
Ronneberger O, Fischer P, Brox T (2015) U-net: Convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention, pp 234–241. Springer
Kirillov A, Wu Y, He K, Girshick R (2020) Pointrend: Image segmentation as rendering. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 9799–9808
Zhang Y, Bai Y, Ding M, Shibiao X, Ghanem B (2020) Kgsnet: key-point-guided super-resolution network for pedestrian detection in the wild. IEEE Trans Neural Netw Learn Syst 32(5):2251–2265
Article Google Scholar
Pal SK, Pramanik A, Maiti J, Mitra P (2021) Deep learning in multi-object detection and tracking: state of the art. Appl Intell 51(9):6400–6429
Article Google Scholar
Shin Y-G, Sagong M-C, Yeo Y-J, Kim S-W, Ko S-J (2020) Pepsi++: Fast and lightweight network for image inpainting. IEEE Trans Neural Netw Learn Syst 32(1):252–265
Article Google Scholar
Chen Y, Zhang H, Liu L, Chen X, Zhang Q, Yang K, Xia R, Xie J (2021) Research on image inpainting algorithm of improved gan based on two-discriminations networks. Appl Intell 51(6):3460–3474
Article Google Scholar
Hong C, Jun Y, Zhang J, Jin X, Lee K-H (2018) Multimodal face-pose estimation with multitask manifold deep learning. IEEE Trans Industr Inform 15(7):3952–3961
Article Google Scholar
Zhongxu H, Youmin H, Bo W, Liu J, Han D, Kurfess T (2018) Hand pose estimation with multi-scale network. Appl Intell 48(8):2501–2515
Article Google Scholar
Li X, Zhou Y, Peng D, Lang G, Min X, Wei W (2021) A deep learning system that generates quantitative ct reports for diagnosing pulmonary tuberculosis. Appl Intell 51(6):4082–4093
Article Google Scholar
Xia H, Ma M, Li H, Song S (2021) Mc-net: multi-scale context-attention network for medical ct image segmentation. Appl Intell 1–12
Liao F, Liang M, Li Z, Xiaolin H, Song S (2019) Evaluate the malignancy of pulmonary nodules using the 3-d deep leaky noisy-or network. IEEE Trans Neural Netw Learn Syst 30(11):3484–3495
Article Google Scholar
Roy R, Chakraborti T, Chowdhury AS (2019) A deep learning-shape driven level set synergism for pulmonary nodule segmentation. Pattern Recognit Lett 123:31–38
Article Google Scholar
Shakibapour E, Cunha A, Aresta G, Mendonça AM, Campilho A (2019) An unsupervised metaheuristic search approach for segmentation and volume measurement of pulmonary nodules in lung ct scans. Exp Syst Appl 119:415–428
Article Google Scholar
Zhu W, Liu C, Fan W, Xie X (2018) Deeplung: Deep 3d dual path nets for automated pulmonary nodule detection and classification. In: 2018 IEEE Winter conference on applications of computer vision (WACV), pp 673–681. IEEE
Wei Y, Liang X, Chen Y, Shen X, Cheng M-M, Feng J, Zhao Y, Yan S (2016) Stc: A simple to complex framework for weakly-supervised semantic segmentation. IEEE Trans Pattern Anal Mach Intell 39(11):2314–2320
Article Google Scholar
Li M, Hsu W, Xie X, Cong J, Gao W (2020) Sacnn: Self-attention convolutional neural network for low-dose ct denoising with self-supervised perceptual loss network. IEEE Trans Med Imaging 39(7):2289–2301
Article Google Scholar
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440
Christ PF, Ettlinger F, Grün F, Elshaera MEA, Lipkova J, Schlecht S, Ahmaddy F, Tatavarty S, Bickel M, Bilic P et al (2017) Automatic liver and tumor segmentation of ct and mri volumes using cascaded fully convolutional neural networks. arXiv:1702.05970
Bi L, Feng D, Kim J (2018) Dual-path adversarial learning for fully convolutional network (fcn)-based medical image segmentation. Vis Comput 34(6):1043–1052
Article Google Scholar
Wu Y, Lin L (2020) Automatic lung segmentation in ct images using dilated convolution based weighted fully convolutional network. In: Journal of Physics: Conference Series, vol 1646, pp 012032. IOP Publishing
Chen L-C, Papandreou G, Kokkinos I, Murphy K, Yuille AL (2017) Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans Pattern Anal Mach Intell 40(4):834–848
Article Google Scholar
Men K, Dai J, Li Y (2017) Automatic segmentation of the clinical target volume and organs at risk in the planning ct for rectal cancer using deep dilated convolutional neural networks. Med Phys 44(12):6377–6389
Article Google Scholar
Xia H, Sun W, Song S, Mou X (2020) Md-net: multi-scale dilated convolution network for ct images segmentation. Neural Process Lett 51(3):2915–2927
Article Google Scholar
Liu Z, Liu X, Xiao B, Wang S, Miao Z, Sun Y, Zhang F (2020) Segmentation of organs-at-risk in cervical cancer ct images with a convolutional neural network. Phys Med 69:184–191
Article Google Scholar
Zhao X, Zhang P, Song F, Fan G, Sun Y, Wang Y, Tian Z, Zhang L, Zhang G (2021) D2a u-net: Automatic segmentation of covid-19 ct slices based on dual attention and hybrid dilated convolution. Comput Biol Med 104526
Zhou Z, Siddiquee Md MR, Tajbakhsh N, Liang J (2018) Unet++: A nested u-net architecture for medical image segmentation. In: Deep learning in medical image analysis and multimodal learning for clinical decision support, pp 3–11. Springer
Li X, Chen H, Qi X, Dou Q, Chi-Wing F, Heng P-A (2018) H-denseunet: hybrid densely connected unet for liver and tumor segmentation from ct volumes. IEEE Trans Med Imaging 37(12):2663–2674
Article Google Scholar
He K, Lian C, Zhang B, Zhang X, Cao X, Nie D, Gao Y, Zhang J, Shen D (2021) Hf-unet: learning hierarchically inter-task relevance in multi-task u-net for accurate prostate segmentation in ct images. IEEE Trans Med Imaging 40(8):2118–2128
Article Google Scholar
Khan RA, Luo Y, Wu F-X (2022) Rms-unet: Residual multi-scale unet for liver and lesion segmentation. Artif Intell Med 124:102231
Article Google Scholar
Yin S, Deng H, Zelin X, Zhu Q, Cheng J (2022) Sd-unet: A novel segmentation framework for ct images of lung infections. Electronics 11(1):130
Article Google Scholar
Mehta S, Rastegari M, Caspi A, Shapiro L, Hajishirzi H (2018) Espnet: Efficient spatial pyramid of dilated convolutions for semantic segmentation. In: Proceedings of the european conference on computer vision (ECCV), pp 552–568
Chen Y, Wang K, Liao X, Qian Y, Wang Q, Yuan Z, Heng P-A (2019) Channel-unet: a spatial channel-wise convolutional neural network for liver and tumors segmentation. Front Genet 10:1110
Article Google Scholar
Kushnure DT, Talbar SN (2021) Ms-unet: A multi-scale unet with feature recalibration approach for automatic liver and tumor segmentation in ct images. Comput Med Imaging Graph 89:101885
Article Google Scholar
Xie X, Pan X, Shao F, Zhang W, An J (2022) Mci-net: Multi-scale context integrated network for liver ct image segmentation. Comput Electric Eng 101:108085
Article Google Scholar
Xia H, Ma M, Li H, Song S (2022) Mc-net: multi-scale context-attention network for medical ct image segmentation. Appl Intell 52(2):1508–1519
Article Google Scholar
Woo S, Park J, Lee J-Y, Kweon IS (2018) Cbam: Convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19
Srinivas A, Lin T-Y, Parmar N, Shlens J, Abbeel P, Vaswani A (2021) Bottleneck transformers for visual recognition. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 16519–16529
Sinha A, Dolz J (2020) Multi-scale self-guided attention for medical image segmentation. IEEE J Biomed Health Inform 25(1):121–130
Article Google Scholar
Zhang C, Jingben L, Hua Q, Li C, Wang P (2022) Saa-net: U-shaped network with scale-axis-attention for liver tumor segmentation. Biomed Signal Process Control 73:103460
Article Google Scholar
Tang H, Liu X, Han K, Xie X, Chen X, Qian H, Liu Y, Sun S, Bai N (2021) Spatial context-aware self-attention model for multi-organ segmentation. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision, pp 939–949
Chen L-C, Papandreou G, Schroff F, Adam H (2017) Rethinking atrous convolution for semantic image segmentation. arXiv:1706.05587
Chen L-C, Zhu Y, Papandreou G, Schroff F, Adam H (2018) Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European conference on computer vision (ECCV), pp 801–818
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Wang C-Y, Liao H-YM, Wu Y-H, Chen P-Y, Hsieh J-W, Yeh I-H (2020) Cspnet: A new backbone that can enhance learning capability of cnn. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, pp 390–391
Cao Y, Xu J, Lin S, Wei F, Hu H (2019) Gcnet: Non-local networks meet squeeze-excitation networks and beyond. In: Proceedings of the IEEE/CVF international conference on computer vision workshops
Wang Q, Wu B, Zhu P, Li P, Zuo W, Hu Q (2020) Eca-net: efficient channel attention for deep convolutional neural networks, 2020 IEEE. In: CVF conference on computer vision and pattern recognition (CVPR). IEEE
Maqsood M, Yasmin S, Mehmood I, Bukhari M, Kim M (2021) An efficient da-net architecture for lung nodule segmentation. Mathematics 9(13):1457
Article Google Scholar
Shuo Wang M, Zhou ZL, Liu Z, Dongsheng G, Zang Y, Dong D, Gevaert O, Tian J (2017) Central focused convolutional neural networks: Developing a data-driven model for lung nodule segmentation. Med Image Anal 40:172–183
Article Google Scholar
Jiang J, Hu Y-C, Liu C-J, Halpenny D, Hellmann MD, Deasy JO, Mageras G, Veeraraghavan H (2018) Multiple resolution residually connected feature streams for automatic lung tumor segmentation from ct images. IEEE Tran Med Imaging 38(1):134–144
Article Google Scholar
Huang X, Shan J, Vaidya V (2017) Lung nodule detection in ct using 3d convolutional neural networks. In: 2017 IEEE 14th international symposium on biomedical imaging (ISBI 2017), pp 379–383. IEEE
Zhao X, Sun W, Qian W, Qi S, Sun J, Zhang B, Yang Z (2019) Fine-grained lung nodule segmentation with pyramid deconvolutional neural network. In Medical Imaging 2019: Computer-Aided Diagnosis, vol 10950, pp 956–961. SPIE
Hancock MC, Magnan JF (2019) Lung nodule segmentation via level set machine learning. arXiv:1910.03191
Huang X, Sun W, Tseng T-LB, Li C, Qian W (2019) Fast and fully-automated detection and segmentation of pulmonary nodules in thoracic ct scans using deep convolutional neural networks. Comput Med Imaging Graph 74:25–36
Article Google Scholar
Wang J, Lv P, Wang H, Shi C (2021) Sar-u-net: squeeze-and-excitation block and atrous spatial pyramid pooling based residual u-net for automatic liver segmentation in computed tomography. Comput Methods Programs Biomed 208:106268
Article Google Scholar
Oktay O, Schlemper J, Folgoc LL, Lee M, Heinrich M, Misawa K, Mori K, McDonagh S, Hammerla NY, Kainz B et al (2018) Attention u-net: Learning where to look for the pancreas. arXiv:1804.03999
Changling L, Xiangfen S, Hang Z, Li F, Tao H, Yuchen Z, Jun J, Jianan W, Jianping X, Yong S (2021) An 8-layer residual u-net with deep supervision for segmentation of the left ventricle in cardiac ct angiography. Comput Methods Programs Biomed 200:105876

Download references

Acknowledgements

This work has been supported in part by the National Natural Science Foundation of China under Grant 62102331, 62176125 and 61772272, which also has been supported in part by Natural Science Foundation of Sichuan Province 2022NSFSC0839 and Southwest University of Science and Technology Doctoral Research Fund Project 22zx7110.

Author information

Authors and Affiliations

School of Information Engineering, Southwest University of Science and Technology, Mianyang, 621010, China
Xiaoqian Zhang, Lei Pu & Xiao Wang
School of Computer Science and Technology, Southwest University of Science and Technology, Mianyang, 621010, China
Liming Wan
Radiology Department, Mianyang Central Hospital, Mianyang, 621010, China
Ying Zhou

Authors

Xiaoqian Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Lei Pu
View author publications
You can also search for this author in PubMed Google Scholar
Liming Wan
View author publications
You can also search for this author in PubMed Google Scholar
Xiao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ying Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Xiaoqian Zhang: Software, Writing-Reviewing and Editing. Lei Pu: Conceptualization, Methodology, Software. Liming Wan: Writing-Original draft preparation, Data curation, Software. Xiao Wang: Software, Editing. Ying Zhou: Data curation.

Corresponding author

Correspondence to Xiaoqian Zhang.

Ethics declarations

Competing interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical and informed consent for data used

This article does not contain any studies with human participants or animals performed by the author. We obtain ethical and informed consent from data subjects before collecting, using, or disclosing their personal data.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Zhang, X., Pu, L., Wan, L. et al. DS-MSFF-Net: Dual-path self-attention multi-scale feature fusion network for CT image segmentation. Appl Intell 54, 4490–4506 (2024). https://doi.org/10.1007/s10489-024-05372-7

Download citation

Accepted: 07 March 2024
Published: 27 March 2024
Issue Date: March 2024
DOI: https://doi.org/10.1007/s10489-024-05372-7

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

DS-MSFF-Net: Dual-path self-attention multi-scale feature fusion network for CT image segmentation

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Hybrid Multi-atrous and Multi-scale Network for Liver Lesion Detection

Multi-scale feature pyramid fusion network for medical image segmentation

CT-UFormer: an improved hybrid decoder for image segmentation

Data availability and access

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Conflict of interest

Ethical and informed consent for data used

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

DS-MSFF-Net: Dual-path self-attention multi-scale feature fusion network for CT image segmentation

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Hybrid Multi-atrous and Multi-scale Network for Liver Lesion Detection

Multi-scale feature pyramid fusion network for medical image segmentation

CT-UFormer: an improved hybrid decoder for image segmentation

Explore related subjects

Data availability and access

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Conflict of interest

Ethical and informed consent for data used

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.