Learning from pseudo-lesion: a self-supervised framework for COVID-19 diagnosis

Zhongliang Li; Xuechen Li; Zhihao Jin; Linlin Shen

doi:10.1007/s00521-023-08259-9

Learning from pseudo-lesion: a self-supervised framework for COVID-19 diagnosis

Zhongliang Li, Xuechen Li, Zhihao Jin, Linlin Shen

Research output: Journal Publication › Article › peer-review

Abstract

The Coronavirus disease 2019 (COVID-19) has rapidly spread all over the world since its first report in December 2019, and thoracic computed tomography (CT) has become one of the main tools for its diagnosis. In recent years, deep learning-based approaches have shown impressive performance in myriad image recognition tasks. However, they usually require a large number of annotated data for training. Inspired by ground glass opacity, a common finding in COIVD-19 patient’s CT scans, we proposed in this paper a novel self-supervised pretraining method based on pseudo-lesion generation and restoration for COVID-19 diagnosis. We used Perlin noise, a gradient noise based mathematical model, to generate lesion-like patterns, which were then randomly pasted to the lung regions of normal CT images to generate pseudo-COVID-19 images. The pairs of normal and pseudo-COVID-19 images were then used to train an encoder–decoder architecture-based U-Net for image restoration, which does not require any labeled data. The pretrained encoder was then fine-tuned using labeled data for COVID-19 diagnosis task. Two public COVID-19 diagnosis datasets made up of CT images were employed for evaluation. Comprehensive experimental results demonstrated that the proposed self-supervised learning approach could extract better feature representation for COVID-19 diagnosis, and the accuracy of the proposed method outperformed the supervised model pretrained on large-scale images by 6.57% and 3.03% on SARS-CoV-2 dataset and Jinan COVID-19 dataset, respectively.

Original language	English
Pages (from-to)	10717-10731
Number of pages	15
Journal	Neural Computing and Applications
Volume	35
Issue number	15
DOIs	https://doi.org/10.1007/s00521-023-08259-9
Publication status	Published - May 2023
Externally published	Yes

Keywords

COVID-19 diagnosis
Lesion modeling
Self-supervised learning

ASJC Scopus subject areas

Software
Artificial Intelligence

Access to Document

10.1007/s00521-023-08259-9

Cite this

@article{bb5aced556e14904a7a5817d9ecc3577,

title = "Learning from pseudo-lesion: a self-supervised framework for COVID-19 diagnosis",

abstract = "The Coronavirus disease 2019 (COVID-19) has rapidly spread all over the world since its first report in December 2019, and thoracic computed tomography (CT) has become one of the main tools for its diagnosis. In recent years, deep learning-based approaches have shown impressive performance in myriad image recognition tasks. However, they usually require a large number of annotated data for training. Inspired by ground glass opacity, a common finding in COIVD-19 patient{\textquoteright}s CT scans, we proposed in this paper a novel self-supervised pretraining method based on pseudo-lesion generation and restoration for COVID-19 diagnosis. We used Perlin noise, a gradient noise based mathematical model, to generate lesion-like patterns, which were then randomly pasted to the lung regions of normal CT images to generate pseudo-COVID-19 images. The pairs of normal and pseudo-COVID-19 images were then used to train an encoder–decoder architecture-based U-Net for image restoration, which does not require any labeled data. The pretrained encoder was then fine-tuned using labeled data for COVID-19 diagnosis task. Two public COVID-19 diagnosis datasets made up of CT images were employed for evaluation. Comprehensive experimental results demonstrated that the proposed self-supervised learning approach could extract better feature representation for COVID-19 diagnosis, and the accuracy of the proposed method outperformed the supervised model pretrained on large-scale images by 6.57% and 3.03% on SARS-CoV-2 dataset and Jinan COVID-19 dataset, respectively.",

keywords = "COVID-19 diagnosis, Lesion modeling, Self-supervised learning",

author = "Zhongliang Li and Xuechen Li and Zhihao Jin and Linlin Shen",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.",

year = "2023",

month = may,

doi = "10.1007/s00521-023-08259-9",

language = "English",

volume = "35",

pages = "10717--10731",

journal = "Neural Computing and Applications",

issn = "0941-0643",

publisher = "Springer London",

number = "15",

}

TY - JOUR

T1 - Learning from pseudo-lesion

T2 - a self-supervised framework for COVID-19 diagnosis

AU - Li, Zhongliang

AU - Li, Xuechen

AU - Jin, Zhihao

AU - Shen, Linlin

PY - 2023/5

Y1 - 2023/5

N2 - The Coronavirus disease 2019 (COVID-19) has rapidly spread all over the world since its first report in December 2019, and thoracic computed tomography (CT) has become one of the main tools for its diagnosis. In recent years, deep learning-based approaches have shown impressive performance in myriad image recognition tasks. However, they usually require a large number of annotated data for training. Inspired by ground glass opacity, a common finding in COIVD-19 patient’s CT scans, we proposed in this paper a novel self-supervised pretraining method based on pseudo-lesion generation and restoration for COVID-19 diagnosis. We used Perlin noise, a gradient noise based mathematical model, to generate lesion-like patterns, which were then randomly pasted to the lung regions of normal CT images to generate pseudo-COVID-19 images. The pairs of normal and pseudo-COVID-19 images were then used to train an encoder–decoder architecture-based U-Net for image restoration, which does not require any labeled data. The pretrained encoder was then fine-tuned using labeled data for COVID-19 diagnosis task. Two public COVID-19 diagnosis datasets made up of CT images were employed for evaluation. Comprehensive experimental results demonstrated that the proposed self-supervised learning approach could extract better feature representation for COVID-19 diagnosis, and the accuracy of the proposed method outperformed the supervised model pretrained on large-scale images by 6.57% and 3.03% on SARS-CoV-2 dataset and Jinan COVID-19 dataset, respectively.

AB - The Coronavirus disease 2019 (COVID-19) has rapidly spread all over the world since its first report in December 2019, and thoracic computed tomography (CT) has become one of the main tools for its diagnosis. In recent years, deep learning-based approaches have shown impressive performance in myriad image recognition tasks. However, they usually require a large number of annotated data for training. Inspired by ground glass opacity, a common finding in COIVD-19 patient’s CT scans, we proposed in this paper a novel self-supervised pretraining method based on pseudo-lesion generation and restoration for COVID-19 diagnosis. We used Perlin noise, a gradient noise based mathematical model, to generate lesion-like patterns, which were then randomly pasted to the lung regions of normal CT images to generate pseudo-COVID-19 images. The pairs of normal and pseudo-COVID-19 images were then used to train an encoder–decoder architecture-based U-Net for image restoration, which does not require any labeled data. The pretrained encoder was then fine-tuned using labeled data for COVID-19 diagnosis task. Two public COVID-19 diagnosis datasets made up of CT images were employed for evaluation. Comprehensive experimental results demonstrated that the proposed self-supervised learning approach could extract better feature representation for COVID-19 diagnosis, and the accuracy of the proposed method outperformed the supervised model pretrained on large-scale images by 6.57% and 3.03% on SARS-CoV-2 dataset and Jinan COVID-19 dataset, respectively.

KW - COVID-19 diagnosis

KW - Lesion modeling

KW - Self-supervised learning

UR - http://www.scopus.com/inward/record.url?scp=85150624470&partnerID=8YFLogxK

U2 - 10.1007/s00521-023-08259-9

DO - 10.1007/s00521-023-08259-9

M3 - Article

AN - SCOPUS:85150624470

SN - 0941-0643

VL - 35

SP - 10717

EP - 10731

JO - Neural Computing and Applications

JF - Neural Computing and Applications

IS - 15

ER -

Learning from pseudo-lesion: a self-supervised framework for COVID-19 diagnosis

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this