MSRD-Unet: Multiscale Residual Dilated U-Net for Medical Image Segmentation

Main Article Content

Muna Khalaf
Ban N. Dhannoon


Semantic segmentation is an exciting research topic in medical image analysis because it aims to detect objects in medical images. In recent years, approaches based on deep learning have shown a more reliable performance than traditional approaches in medical image segmentation. The U-Net network is one of the most successful end-to-end convolutional neural networks (CNNs) presented for medical image segmentation. This paper proposes a multiscale Residual Dilated convolution neural network (MSRD-UNet) based on U-Net. MSRD-UNet replaced the traditional convolution block with a novel deeper block that fuses multi-layer features using dilated and residual convolution. In addition, the squeeze and execution attention mechanism (SE) and the skip connections are redesigned to give a more reliable fusion of features. MSRD-UNet allows aggregation of contextual information, and the network goes without needing to increase the number of parameters or required floating-point operations (FLOPS). The proposed model was evaluated on three multimodal datasets: polyp, skin lesion, and nuclei segmentation. The obtained results proved that the MSDR-Unet model outperforms several state-of-the-art U-Net-based methods.


Download data is not yet available.

Article Details

How to Cite
Khalaf M, Dhannoon BN. MSRD-Unet: Multiscale Residual Dilated U-Net for Medical Image Segmentation. Baghdad Sci.J [Internet]. 2022 Dec. 5 [cited 2023 Jan. 28];19(6(Suppl.):1603. Available from:


Asroni A, Ku-Mahamud KR, Damarjati C, Slamat HB. Arabic speech classification method based on padding and deep learning neural network. Baghdad Sci J. 2021; 18(2): 925–36.

Harba ES, Harba HS, Abdulmunem IA. Advanced Intelligent Data Hiding Using Video Stego and Convolutional Neural Networks. Baghdad Sci J. 2021; 18(4): 1317–27.

Dong S, Wang P, Abbas K. A survey on deep learning and its applications. Comput Sci Rev [Internet]. 2021; 40: 100379.

Ronneberger O, Fischer P, Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In: International Conference on Medical image computing and computer-assisted intervention. Cham: Springer; 2015. p. 234–41.

Liu L, Cheng J, Quan Q, Wu FX, Wang YP, Wang J. A survey on U-shaped networks in medical image segmentations. Neurocomputing [Internet]. 2020; 409: 244–58.

Khalaf M, Dhannoon BN. Skin Lesion Segmentation based on U-Shaped Network. Karbala Int J Mod Sci. 2022; 8(3): 493-502.

Sarwinda D, Paradisa RH, Bustamam A, Anggia P. Deep Learning in Image Classification using Residual Network (ResNet) Variants for Detection of Colorectal Cancer. Procedia Comput Sci [Internet]. 2021; 179(2019): 423–31.

Zuo Q, Chen S, Wang Z. R2AU-Net: attention recurrent residual convolutional neural network for multimodal medical image segmentation. Secur. Commun. Netw. 2021 Jun 10;2021.

Zhuang J. LadderNet: Multi-path networks based on U-Net for medical image segmentation. arXiv Prepr arXiv181007810 [Internet]. 2018; 2–5.

Huang Z, Zhao Y, Liu Y, Song G. GCAUNet: A group cross-channel attention residual UNet for slice based brain tumor segmentation. Biomed Signal Process Control [Internet]. 2021; 70(June): 102958.

Vashishth S, Upadhyay S, Tomar GS, Faruqui M. Attention Interpretability Across NLP Tasks. 2019; 1–10. Available from:

Wang F, Tax DMJ. Survey on the attention based RNN model and its applications in computer vision. 2016; Available from:

Oktay O, Schlemper J, Folgoc L Le, Lee M, Heinrich M, Misawa K, et al. Attention U-Net: Learning Where to Look for the Pancreas. arXiv Prepr arXiv180403999 [Internet]. 2018; (Midl). Available from:

Sinha A, Dolz J. Multi-scale self-guided attention for medical image segmentation. IEEE J Biomed Health Inform. 2020 Apr 14;25(1):121-30.

Wen J, Li Z, Shen Z, Zheng Y, Zheng S. Squeeze-and-Excitation Encoder-Decoder Network for Kidney and Kidney Tumor Segmentation in CT Images. In: International Challenge on Kidney and Kidney Tumor Segmentation. Cham: Springer; 2022. p. 71–9.

Hu J, Shen L, Sun G. Squeeze-and-excitation networks. InProceedings of the IEEE conference on computer vision and pattern recognition 2018 (pp. 7132-7141).

Zhou Z, Siddiquee MR, Tajbakhsh N. UNet ++ : A Nested U-Net Architecture for Medical Image Segmentation Zongwei. In: Deep learning in medical image analysis and multimodal learning for clinical decision support. Cham: Springer International Publishing; 2018. p. 3–11.

Huang H, Lin L, Tong R, Hu H, Zhang Q, Iwamoto Y, Han X, Chen YW, Wu J. Unet 3+: A full-scale connected unet for medical image segmentation. InICASSP. IEEE Int Conf Acoust Speech Signal Process 2020 May 4 : 1055-1059.

Yu F, Koltun V. Multi-scale context aggregation by dilated convolutions. arXiv Prepr. 2016; 1412.7062.

Wang S, Hu S-Y, Cheah E, Wang X, Wang J, Chen L, et al. U-Net Using Stacked Dilated Convolutions for Medical Image Segmentation. arXiv Prepr arXiv200403466 [Internet]. 2020; Available from:

Moreno Lopez M, Ventura J. Dilated convolutions for brain tumor segmentation in MRI scans. In: nternational MICCAI brainlesion workshop. Cham: Springer; 2018. p. 253–62.

Yang J, Zhu J, Wang H, Yang X. Dilated MultiResUNet: Dilated multiresidual blocks network based on U-Net for biomedical image segmentation. Biomed Signal Process Control [Internet]. 2021; 68(January):102643.

Su R, Zhang D, Liu J, Cheng C. MSU-Net: Multi-Scale U-Net for 2D Medical Image Segmentation. Front Genet. 2021; 12(February): 1–14.

Caicedo JC, Goodman A, Karhohs KW, Cimini BA, Ackerman J, Haghighi M, et al. Nucleus segmentation across imaging experiments: the 2018 Data Science Bowl. Nat Methods [Internet]. 2019; 16(12): 1247–53.

Gutman D, Codella NCF, Celebi E, Helba B, Marchetti M, Mishra N, et al. Skin Lesion Analysis toward Melanoma Detection: A Challenge at the International Symposium on Biomedical Imaging (ISBI) 2016, hosted by the International Skin Imaging Collaboration (ISIC). eprint arXiv160501397 2016 [Internet]. 2016; 3–7. Available from:

Bernal J, Sánchez FJ, Fernández-Esparrach G, Gil D, Rodríguez C, Vilariño F. WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians. Comput Med Imaging Graph. 2015; 43: 99–111.

Wang R, Lei T, Cui R, Zhang B, Meng H, Nandi AK. Medical image segmentation using deep learning: A survey. IET Image Process. 2022; 16(5): 1243–67.

Yacin Sikkandar M, Alrasheadi BA, Prakash NB, Hemalakshmi GR, Mohanarathinam A, Shankar K. Deep learning based an automated skin lesion segmentation and intelligent classification model. J Ambient Intell Humaniz Comput. 2021; 12(3): 3245–55.

Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, et al. PyTorch: An imperative style, high-performance deep learning library. Adv Neural Inf Process Syst. 2019; 32.

Sudre CH, Li W, Vercauteren T, Ourselin S, Jorge Cardoso M. Generalised dice overlap as a deep learning loss function for highly unbalanced segmentations. Deep Learn Med image Anal multimodal Learn Clin Decis Support. 2017; 10553 LNCS(2017): 240–8.

Kingma DP, Ba JL. Adam: A method for stochastic optimization. arXiv Prepr. 2015; 1–15.

Al-Kababji A, Bensaali F, Prasad S. Scheduling Techniques for Liver Segmentation: ReduceLRonPlateau Vs OneCycleLR. J Imaging [Internet]. 2022; 8(3): 55. Available from: