TY - GEN
T1 - Semi-Supervised Surgical Video Semantic Segmentation with Cross Supervision of Inter-Frame
AU - Li, Derui
AU - Hu, Yan
AU - Shen, Junyong
AU - Hao, Luoying
AU - Liu, Jiang
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Accurate surgical video semantic segmentation is vital for computer-aided surgery. Semi-supervised algorithms produce pseudo labels to solve the problem of the lack of labels, as it is very difficult to obtain the pixel-level segmentation labels from doctors or researchers. However, most of the algorithms consider the videos as independent images, which cannot solve some issues caused by complex surgery scenarios, such as blurred instruments. The paper proposes a novel Cross Supervision of Inter-frame (CSI) method using inter-frame information from surgery video to crosswise supervise semantic segmentation. Specifically, we design Inter-frame Information Transformation (I2T) modules to transfer features with class prototypes between continuous frames mutually. Besides, we utilize ground truth to supervise inter-frame features for labeled frames, and for unlabeled frames, we propose a cross pseudo loss and a pixel-wise contrastive loss as the constraints. Extensive experiments are performed on a publicly available cataract surgery dataset, which proves that our CSI method improves the segmentation accuracy after considering the inter-frame information.
AB - Accurate surgical video semantic segmentation is vital for computer-aided surgery. Semi-supervised algorithms produce pseudo labels to solve the problem of the lack of labels, as it is very difficult to obtain the pixel-level segmentation labels from doctors or researchers. However, most of the algorithms consider the videos as independent images, which cannot solve some issues caused by complex surgery scenarios, such as blurred instruments. The paper proposes a novel Cross Supervision of Inter-frame (CSI) method using inter-frame information from surgery video to crosswise supervise semantic segmentation. Specifically, we design Inter-frame Information Transformation (I2T) modules to transfer features with class prototypes between continuous frames mutually. Besides, we utilize ground truth to supervise inter-frame features for labeled frames, and for unlabeled frames, we propose a cross pseudo loss and a pixel-wise contrastive loss as the constraints. Extensive experiments are performed on a publicly available cataract surgery dataset, which proves that our CSI method improves the segmentation accuracy after considering the inter-frame information.
KW - Cataract surgery
KW - Inter-frame
KW - Segmentation
KW - Semi-supervised
UR - http://www.scopus.com/inward/record.url?scp=85172167317&partnerID=8YFLogxK
U2 - 10.1109/ISBI53787.2023.10230375
DO - 10.1109/ISBI53787.2023.10230375
M3 - Conference contribution
AN - SCOPUS:85172167317
T3 - Proceedings - International Symposium on Biomedical Imaging
BT - 2023 IEEE International Symposium on Biomedical Imaging, ISBI 2023
PB - IEEE Computer Society
T2 - 20th IEEE International Symposium on Biomedical Imaging, ISBI 2023
Y2 - 18 April 2023 through 21 April 2023
ER -