Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization

Jinheng Xie; Cheng Luo; Xiangping Zhu; Ziqi Jin; Weizeng Lu; Linlin Shen

doi:10.1109/ICCV48922.2021.00020

Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization

Jinheng Xie, Cheng Luo, Xiangping Zhu, Ziqi Jin, Weizeng Lu, Linlin Shen

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

54 Citations (Scopus)

Abstract

We present a two-stage learning framework for weakly supervised object localization (WSOL). While most previous efforts rely on high-level feature based CAMs (Class Activation Maps), this paper proposes to localize objects using the low-level feature based activation maps. In the first stage, an activation map generator produces activation maps based on the low-level feature maps in the classifier, such that rich contextual object information is included in an online manner. In the second stage, we employ an evaluator to evaluate the activation maps predicted by the activation map generator. Based on this, we further propose a weighted entropy loss, an attentive erasing, and an area loss to drive the activation map generator to substantially reduce the uncertainty of activations between object and background, and explore less discriminative regions. Based on the low-level object information preserved in the first stage, the second stage model gradually generates a well-separated, complete, and compact activation map of object in the image, which can be easily thresholded for accurate localization. Extensive experiments on CUB-200-2011 and ImageNet-1K datasets show that our framework surpasses previous methods by a large margin, which sets a new state-of-the-art for WSOL. Code will be available soon.

Original language	English
Title of host publication	Proceedings - 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	132-141
Number of pages	10
ISBN (Electronic)	9781665428125
DOIs	https://doi.org/10.1109/ICCV48922.2021.00020
Publication status	Published - 2021
Externally published	Yes
Event	18th IEEE/CVF International Conference on Computer Vision, ICCV 2021 - Virtual, Online, Canada Duration: 11 Oct 2021 → 17 Oct 2021

Publication series

Name	Proceedings of the IEEE International Conference on Computer Vision
ISSN (Print)	1550-5499

Conference

Conference	18th IEEE/CVF International Conference on Computer Vision, ICCV 2021
Country/Territory	Canada
City	Virtual, Online
Period	11/10/21 → 17/10/21

ASJC Scopus subject areas

Software
Computer Vision and Pattern Recognition

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1109/ICCV48922.2021.00020

Cite this

Xie, J., Luo, C., Zhu, X., Jin, Z., Lu, W., & Shen, L. (2021). Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization. In Proceedings - 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021 (pp. 132-141). (Proceedings of the IEEE International Conference on Computer Vision). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICCV48922.2021.00020

Xie, Jinheng ; Luo, Cheng ; Zhu, Xiangping et al. / Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization. Proceedings - 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021. Institute of Electrical and Electronics Engineers Inc., 2021. pp. 132-141 (Proceedings of the IEEE International Conference on Computer Vision).

@inproceedings{69d904fb81d14a0cb308a208a62e4dd0,

title = "Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization",

abstract = "We present a two-stage learning framework for weakly supervised object localization (WSOL). While most previous efforts rely on high-level feature based CAMs (Class Activation Maps), this paper proposes to localize objects using the low-level feature based activation maps. In the first stage, an activation map generator produces activation maps based on the low-level feature maps in the classifier, such that rich contextual object information is included in an online manner. In the second stage, we employ an evaluator to evaluate the activation maps predicted by the activation map generator. Based on this, we further propose a weighted entropy loss, an attentive erasing, and an area loss to drive the activation map generator to substantially reduce the uncertainty of activations between object and background, and explore less discriminative regions. Based on the low-level object information preserved in the first stage, the second stage model gradually generates a well-separated, complete, and compact activation map of object in the image, which can be easily thresholded for accurate localization. Extensive experiments on CUB-200-2011 and ImageNet-1K datasets show that our framework surpasses previous methods by a large margin, which sets a new state-of-the-art for WSOL. Code will be available soon.",

author = "Jinheng Xie and Cheng Luo and Xiangping Zhu and Ziqi Jin and Weizeng Lu and Linlin Shen",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE; 18th IEEE/CVF International Conference on Computer Vision, ICCV 2021 ; Conference date: 11-10-2021 Through 17-10-2021",

year = "2021",

doi = "10.1109/ICCV48922.2021.00020",

language = "English",

series = "Proceedings of the IEEE International Conference on Computer Vision",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "132--141",

booktitle = "Proceedings - 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021",

address = "United States",

}

Xie, J, Luo, C, Zhu, X, Jin, Z, Lu, W & Shen, L 2021, Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization. in Proceedings - 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021. Proceedings of the IEEE International Conference on Computer Vision, Institute of Electrical and Electronics Engineers Inc., pp. 132-141, 18th IEEE/CVF International Conference on Computer Vision, ICCV 2021, Virtual, Online, Canada, 11/10/21. https://doi.org/10.1109/ICCV48922.2021.00020

Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization. / Xie, Jinheng; Luo, Cheng; Zhu, Xiangping et al.
Proceedings - 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021. Institute of Electrical and Electronics Engineers Inc., 2021. p. 132-141 (Proceedings of the IEEE International Conference on Computer Vision).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization

AU - Xie, Jinheng

AU - Luo, Cheng

AU - Zhu, Xiangping

AU - Jin, Ziqi

AU - Lu, Weizeng

AU - Shen, Linlin

PY - 2021

Y1 - 2021

N2 - We present a two-stage learning framework for weakly supervised object localization (WSOL). While most previous efforts rely on high-level feature based CAMs (Class Activation Maps), this paper proposes to localize objects using the low-level feature based activation maps. In the first stage, an activation map generator produces activation maps based on the low-level feature maps in the classifier, such that rich contextual object information is included in an online manner. In the second stage, we employ an evaluator to evaluate the activation maps predicted by the activation map generator. Based on this, we further propose a weighted entropy loss, an attentive erasing, and an area loss to drive the activation map generator to substantially reduce the uncertainty of activations between object and background, and explore less discriminative regions. Based on the low-level object information preserved in the first stage, the second stage model gradually generates a well-separated, complete, and compact activation map of object in the image, which can be easily thresholded for accurate localization. Extensive experiments on CUB-200-2011 and ImageNet-1K datasets show that our framework surpasses previous methods by a large margin, which sets a new state-of-the-art for WSOL. Code will be available soon.

AB - We present a two-stage learning framework for weakly supervised object localization (WSOL). While most previous efforts rely on high-level feature based CAMs (Class Activation Maps), this paper proposes to localize objects using the low-level feature based activation maps. In the first stage, an activation map generator produces activation maps based on the low-level feature maps in the classifier, such that rich contextual object information is included in an online manner. In the second stage, we employ an evaluator to evaluate the activation maps predicted by the activation map generator. Based on this, we further propose a weighted entropy loss, an attentive erasing, and an area loss to drive the activation map generator to substantially reduce the uncertainty of activations between object and background, and explore less discriminative regions. Based on the low-level object information preserved in the first stage, the second stage model gradually generates a well-separated, complete, and compact activation map of object in the image, which can be easily thresholded for accurate localization. Extensive experiments on CUB-200-2011 and ImageNet-1K datasets show that our framework surpasses previous methods by a large margin, which sets a new state-of-the-art for WSOL. Code will be available soon.

UR - http://www.scopus.com/inward/record.url?scp=85121150602&partnerID=8YFLogxK

U2 - 10.1109/ICCV48922.2021.00020

DO - 10.1109/ICCV48922.2021.00020

M3 - Conference contribution

AN - SCOPUS:85121150602

T3 - Proceedings of the IEEE International Conference on Computer Vision

SP - 132

EP - 141

BT - Proceedings - 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 18th IEEE/CVF International Conference on Computer Vision, ICCV 2021

Y2 - 11 October 2021 through 17 October 2021

ER -

Xie J, Luo C, Zhu X, Jin Z, Lu W, Shen L. Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization. In Proceedings - 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021. Institute of Electrical and Electronics Engineers Inc. 2021. p. 132-141. (Proceedings of the IEEE International Conference on Computer Vision). doi: 10.1109/ICCV48922.2021.00020

Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization

Abstract

Publication series

Conference

ASJC Scopus subject areas

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this