TransDD: A transformer-based dual-path decoder for improving the performance of thoracic diseases classification using chest X-ray

Image credit: Elsevier

Abstract

Manually and accurately detecting thoracic diseases from CXR images is a time-consuming task that requires experienced radiologists. Therefore, automated thoracic diseases classification has great significance. However, most existing methods solely leverage the feature maps extracted from CXR images to classify thoracic diseases, without effectively connecting the correlation between the local discriminative lesion features and their corresponding labels. To address this issue, we innovatively introduce a learnable label embedding as queries to detect and match class-related features from the feature maps, and then processed by a novel Transformer-based dual-path decoder (TransDD) to facilitate interaction. The proposed TransDD is comprised of three key components: spatial reduction attention (SRA), dual-path attention (DPA), and feature enhancement module (FEM). SRA is employed in simplifying the complexity of self-attention, while DPA is specifically designed to connect the explicit correlation between the features and labels. Moreover, FEM is used to boost the expressiveness of local features. Subsequently, the classification attention block is utilized to balance two classification scores based on the feature output and label output, respectively. The proposed TransDD-PVT attained SOTA performance on the ChestX-ray14 dataset, achieving a mean area under the receiver operating characteristic (AUC) of 83.1% across all 14 classes. Also, our method achieves 94.31% accuracy and 93.31% sensitivity on three-class classifications. Extensive experiments conducted on several datasets demonstrate the powerful ability of our TransDD to improve the performance of thoracic diseases classification. It can serve as a plug-and-play structure to improve the classification performance of both CNNs and recent Transformer-based backbones.

Publication
Biomedical Signal Processing and Control
Xiaoben Jiang 蒋晓奔
Xiaoben Jiang 蒋晓奔
PhD. One apple a day keep the doctor away.

A doctor student of this laboratory, research interests include Medical image processing, AIGC, and Image denoising.

Yu Zhu 朱煜
Yu Zhu 朱煜
Professor. Experts in artificial intelligence and computer vision. Lab leader.

Leader of this laboratory, research interests include Artificial Intelligence, Computer Vision, Industrial controls, Digital Image and Video Processing, Machine learning, Deep Learning and Applications.

Yatong Liu 刘雅童
Yatong Liu 刘雅童
PhD.

A doctoral student of this laboratory, research interests include Medical Image Processing, Deep Learning Algorithm and Multi-source Medical Image Intelligent Analysis.

Gan Cai 蔡淦
Gan Cai 蔡淦
Master.

A Master student of this laboratory, research interests include Deep Learning and Medical Image Processing.