TY - JOUR
T1 - Dual-Attention based Enhanced Unified Net for Precise GTV Segmentation of Nasopharyngeal Carcinoma in 3D MR Images
AU - Khan, Hassan Ali
AU - Xueqing, Gong
AU - Amin, Muhammad Shoib
AU - Siddique, Zeeshan Bin
AU - Naeem, Muhammad Ahtsam
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2025
Y1 - 2025
N2 - Accurate gross tumor volume (GTV) segmentation is essential for effective radiotherapy in nasopharyngeal carcinoma (NPC). However, challenges arise due to the nasopharyngeal region’s complex anatomy and the annotated data scarcity. Our study presents a dual-attention-based enhanced unified network (DAEU-Net) designed for precise NPC GTV segmentation utilizing 3D T1, T2, and T1C-weighted MR images. Our approach involves splitting large-scale MR data into multiple patches and then training every patch independently. This approach effectively captures localized and detailed information without downscaling the image resolution. The DAEU-Net integrates channel-attention and pixel-attention modules within the encoder section, eliminating background noise and reducing information loss by enhancing the network’s focus on detailed features. The decoder section incorporates bottleneck residual blocks to enhance the computing efficiency and robustness of the network. The proposed methodology surpasses the state-of-the-art models with a respective average symmetric surface distance (ASSD) of 0.920±0.386 mm, 0.987±0.421 mm, and 1.043±0.457 mm and a dice similarity coefficient (DSC) of 0.896, 0.871, and 0.851, respectively. Multi-viewed animated MR images in three orthogonal dimensions (axial, sagittal, and coronal) with predicted NPC tumors and real GTV masks were shown to assist in comprehending the tumor’s precise location. Our approach can significantly improves NPC tumor delineation, aids in automated tumor lesion segmentation, and reduces the annotation workload for oncologists.
AB - Accurate gross tumor volume (GTV) segmentation is essential for effective radiotherapy in nasopharyngeal carcinoma (NPC). However, challenges arise due to the nasopharyngeal region’s complex anatomy and the annotated data scarcity. Our study presents a dual-attention-based enhanced unified network (DAEU-Net) designed for precise NPC GTV segmentation utilizing 3D T1, T2, and T1C-weighted MR images. Our approach involves splitting large-scale MR data into multiple patches and then training every patch independently. This approach effectively captures localized and detailed information without downscaling the image resolution. The DAEU-Net integrates channel-attention and pixel-attention modules within the encoder section, eliminating background noise and reducing information loss by enhancing the network’s focus on detailed features. The decoder section incorporates bottleneck residual blocks to enhance the computing efficiency and robustness of the network. The proposed methodology surpasses the state-of-the-art models with a respective average symmetric surface distance (ASSD) of 0.920±0.386 mm, 0.987±0.421 mm, and 1.043±0.457 mm and a dice similarity coefficient (DSC) of 0.896, 0.871, and 0.851, respectively. Multi-viewed animated MR images in three orthogonal dimensions (axial, sagittal, and coronal) with predicted NPC tumors and real GTV masks were shown to assist in comprehending the tumor’s precise location. Our approach can significantly improves NPC tumor delineation, aids in automated tumor lesion segmentation, and reduces the annotation workload for oncologists.
KW - deep learning
KW - Medical Imaging
KW - Neural Networks
KW - NPC
KW - Segmentation
KW - U-Net
UR - https://www.scopus.com/pages/publications/105008675875
U2 - 10.1109/ACCESS.2025.3580600
DO - 10.1109/ACCESS.2025.3580600
M3 - 文章
AN - SCOPUS:105008675875
SN - 2169-3536
JO - IEEE Access
JF - IEEE Access
ER -