MI2A: A Multimodal Information Interaction Architecture for Automated Diagnosis of Lung Nodules Using PET/CT Imaging

Kai Li; Tongtong Li; Lei Zhang; Junfeng Mao; Xiangjun Shi; Zhijun Yao; Lei Fang; Bin Hu

doi:10.1109/jsen.2025.3580100

Abstract

1 min read

Lung cancer is one of the most common malignancies globally, with malignant nodules being an early indicator of the disease. Thus, accurate early diagnosis of lung nodules is imperative. Positron Emission Tomography-Computed Tomography (PET/CT) is a non-invasive imaging technique that provides both anatomical and metabolic information, playing a crucial role in the diagnosis of cancer. Existing deep learning-based multimodal fusion strategies often rely on the simple concatenation of features from two modalities, overlooking the intricate interactions between them. In this study, we proposed a multimodal information interaction framework named MI2A for the automated diagnosis of lung nodules using PET/CT imaging. Specifically, the lung parenchymal regions were cropped as regions of interest using a pre-trained U-Net model. Secondly, higher-order multimodal features from PET/CT scans were extracted and integrated using a custom-designed PET-CT Imaging Encoder (PCIE) module and a Cross-Attention Multimodal Encoder (CAME) module, respectively. Predictions were generated using multi-path pooling layers and a multi-layer perceptron (MLP) layer. Furthermore, an alignment loss function was designed to minimize the discrepancy between modality features during training. Finally, the proposed model was evaluated on an actual clinical dataset, achieving accuracy, precision, recall, specificity, and F1 scores of 0.9179, 0.8972, 0.8937, 0.9335, and 0.8954, respectively. In addition, the findings revealed that certain benign lesions, particularly those related to inflammatory or infectious conditions, displayed high metabolic activity, which is the main reason for limiting the model’s performance. This insight provides a promising direction for future research.

Related publications

Article2025

Semiquantitative Analysis in PET/CT Imaging of Prostate Cancer

Vasiliki Fragkiadaki, Ioannis Ntanasis‐Stathopoulos, Michalis Liontos, Flora Zagouri, Meletios A Dimopoulos, Maria Gavriatopoulou

Article2023

Development and CT image‐domain validation of a computational lung lesion model for use in virtual imaging trials

Thomas Sauer, Adrian Bejan, Paul Segars, Ehsan Samei

Medical Physics

Article1989

Cushing syndrome due to primary pigmented nodular adrenocortical disease: findings at CT and MR imaging.

John L. Doppman, William D. Travis, Lynnette K. Nieman, Donald L. Miller, George Chrousos, M. Tuñón Gómez, Gordon B. Cutler, D. Lynn Loriaux, J A Norton

Article2019

Automated 3-D lung tumor detection and classification by an active contour model and CNN classifier

Gopi Kasinathan, S. K. V. Jayakumar, Amir Gandomi, R. Manikandan, Simon Fong, Rizwan Patan

Expert Systems with Applications

Article2023

Abstract 1032: Development of a plasma circRNA signature for the discrimination of malign lung nodules using the nCounter platform

Carlos Pedraz Valdunciel, Giovanna Maria Stanfoca Casagrande, Elizabeth Martínez‐Pérez, Ana Giménez‐Capitán, Joselyn Valarezo, Pablo Rubinstein, Andrés Aguilar‐Hernández, Leticia Ferro-Leal, Cristina Marino‐Buslje, Rui L Reis, R. Rosell, Miguel Ángel Molina‐Vila