Kevin Freire

Logo

I am a data scientist with over 2 years of experience in machine learning and data analysis. With a Masters degree in Computer Engineering from Toronto Metropolitan University.

MULTI-SCALE DILATION WITH RESIDUAL FUSED ATTENTION NETWORK FOR LOW DOSE CT NOISE ARTIFACT REDUCTIONS


Introduction

Computed Tomography (CT) scans produce more than half the radiation exposure from medical use which result in problems for long term use of these expensive machines. Some solutions have involved reducing the radiation dose, however that leads to noise artifacts making the low dose CT (LDCT) images unreliable for diagnosis. In this study, a Multi-scale Dilation with Residual Fused Attention (MD-RFA) deep neural network is proposed, more specifically a network with an integration with a Multi-scale feature mapping, spatial- and channel-attention module to enhance the quality of LDCT images. Further, the multi-scale image mapping uses a series of dilated convolution layers, which promotes the model to capture hierarchy features of different scales. The attention modules are combined in a parallel connection and are described as a boosting attention fusion block (BAFB) that are then stacked on top of one another creating a residual connection known as a boosting Module group (BMG).

Objective

Methodology - Proposed Model

Figure 1: Multi-scale Dilation with Residual Fused Attention (MD-RFA) Network Architecture.


Figure 2: (A) Boosting Attention Fusion Block, (B) Spatial Attention Module and (C) Channel Attention Module.

Methodology - Dataset

Table 1: NDCT-LDCT Image dataset Specifications [2]

Implementation

Analysis

Figure 3

Table 2

Table 2: summarizes the average PSNR and SSIM results of the different denoising algorithms for Thoracic, Piglet, Head, Chest and Abdomen datasets.


Figure 3: Sample visual results of highlighted sections (red/blue ROI bounding box) from the Chest and Thoracic dataset. From Top to bottom of ROI samples, LDCT, FAM-DRL, MD-RFA, NDCT images.

Conclusion

This study found that incorporating the multi-scale feature mapping and RESNET50 V2 improved feature extraction, and incorporating a boosting module group with a non-local spatial attention reduced training time and improved PSNR and SSIM by about +0.60dB and +0.0229 respectively.

Future Work

Acknowledgements

The authors would like to thank Dr. Cynthia McCollough, the Mayo Clinic, and the America Association of Physicists in Medicine for making the CT data available for the study.

References

[1] L. Marcos, J. Alirezaie, and P. Babyn “Fused Attention Modules in Dilated ResNet for Low-dose CT Denoising With Perceptual, Dissimilarity and Per-Pixel Loss Functions”, in 2021 43rd Annual International Conference of the IEEE Engineering in Medicine 7 Biology Society (EMBC), 2021, pp.3407-3410
[2] X. Yi and P. Babyn, “Sharpness-aware low-dose CT denoising using conditional generative adversarial network,” Journal of digital imaging, vol. 31, no. 5, pp. 655–669, 2018.
[3] S. Bera and P. K. Biswas, “Noise conscious training of non local neural network powered by self attentive spectral normalized markovian patch gan for low dose CT denoising,” IEEE Transactions on Medical Imaging, vol. 40, no. 12, pp. 3663–3673, 2021.
[4] M. Gholizadeh-Ansari, J. Alirezaie, and P. Babyn, “Deep learning for low-dose CT denoising using perceptual loss and edge detection layer,” Journal of digital imaging, vol. 33, no. 2, pp. 504–515, 2020.
[5] K. Dabov, A. Foi, V. Katkovnik, and K. Egiazarian, “Image denoising by sparse 3-D transform-domain collaborative filtering,” IEEE Transactions on image processing, vol. 16, no. 8, pp. 2080–2095, 2007.
[6] Jin Liu, Zhenyu Xia, Yanqin Kang, and Jun Qiang, “Low dose ct noise artifact reduction based on multi-scale weighted convolutional coding network,” in 2021 7th International Conference on Systems and Informatics (ICSAI), 2021, pp. 1–6.