TY - GEN
T1 - A Multi-Scale Self-Attention Network for Diabetic Retinopathy Retrieval
AU - Zeng, Ming
AU - Fang, Jiansheng
AU - Miao, Hanpei
AU - Zhang, Tianyang
AU - Liu, Jiang
N1 - Publisher Copyright:
© 2021 ACM.
PY - 2021/8/13
Y1 - 2021/8/13
N2 - Diabetic retinopathy (DR), a complication due to diabetes, is a common cause of progressive damage to the retina. The mass screening of populations for DR is time-consuming. Therefore, computerized diagnosis is of great significance in the clinical practice, which providing evidence to assist clinicians in decision making. Specifically, hemorrhages, microaneurysms, hard exudates, soft exudates, and other lesions are verified to be closely associated with DR. These lesions, however, are scattered in different positions and sizes in fundus images, the internal relation of which are hard to be reserved in the ultimate features due to a large number of convolution layers that reduce the detail characteristics. In this paper, we present a deep-learning network with a multi-scale self-attention module to aggregate the global context to learned features for DR image retrieval. The multi-scale fusion enhances, in terms of scale, the efficacious latent relation of different positions in features explored by the self-attention. For the experiment, the proposed network is validated on the Kaggle DR dataset, and the result shows that it achieves state-of-the-art performance.
AB - Diabetic retinopathy (DR), a complication due to diabetes, is a common cause of progressive damage to the retina. The mass screening of populations for DR is time-consuming. Therefore, computerized diagnosis is of great significance in the clinical practice, which providing evidence to assist clinicians in decision making. Specifically, hemorrhages, microaneurysms, hard exudates, soft exudates, and other lesions are verified to be closely associated with DR. These lesions, however, are scattered in different positions and sizes in fundus images, the internal relation of which are hard to be reserved in the ultimate features due to a large number of convolution layers that reduce the detail characteristics. In this paper, we present a deep-learning network with a multi-scale self-attention module to aggregate the global context to learned features for DR image retrieval. The multi-scale fusion enhances, in terms of scale, the efficacious latent relation of different positions in features explored by the self-attention. For the experiment, the proposed network is validated on the Kaggle DR dataset, and the result shows that it achieves state-of-the-art performance.
KW - Deep learning
KW - Diabetic retinopathy
KW - Image retrieval
KW - Self-attention
UR - http://www.scopus.com/inward/record.url?scp=85120532249&partnerID=8YFLogxK
U2 - 10.1145/3484274.3484290
DO - 10.1145/3484274.3484290
M3 - Conference contribution
AN - SCOPUS:85120532249
T3 - ACM International Conference Proceeding Series
SP - 101
EP - 106
BT - ICCCV 2021 - Proceedings of the 4th International Conference on Control and Computer Vision
PB - Association for Computing Machinery
T2 - 4th International Conference on Control and Computer Vision, ICCCV 2021
Y2 - 13 August 2021 through 15 August 2021
ER -