LungNet: A Transformer-Based Deep Learning Model for Early Lung Cancer Detection from CT Images

Authors

  • Arun Kumar Assistant Professor, Department of Computer Science & Engineering G.L Bajaj Institute of Technology and Management , Gr. Noida
  • Vijit Srivastava Assistant Professor, Electronics & Communication, United College of Engineering and Research Prayagraj
  • Manisha Mittal Associate Professor (ECE), Guru Tegh Bahadur Institute of Technology, New Delhi, India
  • Manjeet Associate Professor, Electrical and Electronics Engineering Dept., KVMT Khera Siwani. Bhiwani, India
  • Sandeep Associate professor, Department of computer science and engineering, Navkis college of engineering
  • M. Indrapriya Assistant Professor, Department of Banking and Insurance, KPR College of Arts Science and Research, Coimbatore
  • Uttam U. Deshpande Department of Electronics and Communication Engineering, KLS Gogte Institute of Technology, Karnataka, India

Keywords:

Lung cancer detection, Vision Transformer, CT imaging, Deep learning, LungNet, Medical image analysis, Early diagnosis, Hybrid CNN-Transformer, Computer-aided diagnosis

Abstract

Early detection of lung cancer has a greater opportunity in terms of survival rate; however, the current diagnostic systems are already dealing with some difficulties in achieving high accuracy, particularly in early-stage nodules with fine-grained differences. In this paper, we present LungNet, a new deep learning model that combines Vision Transformers (ViT) with a convolutional neural network (CNN) backbone for early detection of lung cancer in computed tomography (CT) scans. Different from the conventional CNN-based networks, the global attention mechanism of transformers in LungNet helps successfully model long-range correlations and context across anatomical regions of the input images, which is crucial to accurately localize the malignant features even in complex backgrounds. We train and evaluate the proposed model on the public dataset LIDC-IDRI, with performance superior to the state-of-the-art, the accuracy, sensitivity, and F1-score are 94.6%, 96.1%, 0.942, respectively. Attention visualization indicates that LungNet pays attention to meaningful regions, thus making the model interpretable. This is a hybrid architecture that succeeds to incorporate the local detail extraction capabilities of CNNs with the global reasoning of transformers, leading to an effective and scalable intermediate-level solution for computer-aided lung cancer screening. Our findings suggest the applicability of LungNet in the radiologist’s workflow with the help of LungNet for more robust, interpretable, and early lung cancer diagnosis.

Downloads

Published

2025-07-08

Issue

Section

Research Articles