HierDETR: Hierarchical Multi-Head Attention-based Pulmonary Nodule Detection in Medical Imaging

Authors

  • Zhenhao Tong

DOI:

https://doi.org/10.54097/aktcae20

Keywords:

Pulmonary Nodule Detection, Hierarchical Multi-Head Attention, Early Lung Cancer Diagnosis

Abstract

Lung cancer is one of the leading causes of morbidity and mortality worldwide, and early detection is crucial for improving patient survival rates. Medical imaging, particularly chest CT scans, plays a vital role in early lung cancer screening and pulmonary nodule detection. However, the detection of pulmonary nodules is challenging due to the variability in size, shape, density, and location of nodules, as well as their complex background, which often includes normal or benign structures such as blood vessels and bronchial walls. To address these challenges, this paper presents a novel pulmonary nodule detection framework based on Hierarchical Multi-Head Attention (HMHA) and Query-Key Cache Updating (QKCU) mechanisms. The proposed HierDETR model improves upon existing methods by reducing redundancy in attention heads and effectively utilizing multi-scale and hierarchical context information. Experimental results on the publicly available LUNA16 dataset demonstrate that the HierDETR model outperforms mainstream methods, achieving significant improvements in detection metrics such as F1 Score, Average Precision, and Average Recall. This work provides a promising approach for enhancing the robustness and accuracy of pulmonary nodule detection, with potential applications in clinical practice for early lung cancer diagnosis.

Downloads

Download data is not yet available.

References

[1] Bray, Freddie, et al. "Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries." CA: a cancer journal for clinicians 68.6 (2018): 394-424.

[2] Javed, Rabia, et al. "Deep learning for lungs cancer detection: a review." Artificial Intelligence Review 57.8 (2024): 197.

[3] El-Bana, S., A. Al-Kabbany, and M. Sharkas. "A Two-Stage Framework for Automated Malignant Pulmonary Nodule Detection in CT Scans. Diagnostics. 2020; 10: 131."

[4] Xiao, Zhitao, et al. "Segmentation of lung nodules using improved 3D-UNet neural network." Symmetry 12.11 (2020): 1787.

[5] Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems 30 (2017).

[6] Walter, Joan E., et al. "Occurrence and lung cancer probability of new solid nodules at incidence screening with low-dose CT: analysis of data from the randomised, controlled NELSON trial." The Lancet Oncology 17.7 (2016): 907-916.

[7] Gu, Yu, et al. "Automatic lung nodule detection using a 3D deep convolutional neural network combined with a multi-scale prediction strategy in chest CTs." Computers in biology and medicine 103 (2018): 220-231.

[8] Zhou, Shihao, et al. "Devil is in the Uniformity: Exploring Diverse Learners within Transformer for Image Restoration." arXiv preprint arXiv:2503.20174 (2025).

[9] Lin, Tsung-Yi, et al. "Focal loss for dense object detection." Proceedings of the IEEE international conference on computer vision. 2017.

[10] Redmon, Joseph, et al. "You only look once: Unified, real-time object detection." Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.

[11] Carion, Nicolas, et al. "End-to-end object detection with transformers." European conference on computer vision. Cham: Springer International Publishing, 2020.

[12] Ramezani, Hooman, Dionne Aleman, and Daniel Létourneau. "Lung-DETR: Deformable Detection Transformer for Sparse Lung Nodule Anomaly Detection." arXiv preprint arXiv:2409.05200 (2024).

Downloads

Published

21-07-2025

Issue

Section

Articles