Spatial–temporal multi-task learning for salient region detection

Zhe Chen; Ruili Wang; Ming Yu; Hongmin Gao; Qi Li; Huibin Wang

doi:10.1016/j.patrec.2018.10.019

Spatial–temporal multi-task learning for salient region detection

Zhe Chen, Ruili Wang, Ming Yu, Hongmin Gao, Qi Li, Huibin Wang

Research output: Journal Publication › Article › peer-review

3 Citations (Scopus)

Abstract

This paper proposes a novel multi-task learning based salient region detection method by fusing spatial and temporal features. Salient region detection has been widely used in various computer vision tasks, being as a general preprocessor to identify interest objects. Despite the recent successes, existing saliency models still lag behind the performance of human when visually perceives dynamic scenes. Most of the existing models largely rely on various spatial features. However, these spatial feature based methods have several deficiencies: (i) they can hardly adapt to the situation where moving objects are included, and (ii) they cannot model the human vision process in dynamic scenes. Recently, some saliency models introduce temporal features in their detecting process, such as the optical flow and stacking frames. The potential of temporal features for saliency optimization has been demonstrated. However, since temporal features in these models are merely used as a compensation to static features, the advantages of temporal features have not yet been fully explored. Aiming to comprehensively address these issues above, our method fuses spatial and temporal features, and learns the mapping relationship from various features to salient regions using our multi-task learning framework. The final salient region is generated by our unified Bayesian framework. The experimental results demonstrated that our proposed approach outperforms previous methods.

Original language	English
Pages (from-to)	76-83
Number of pages	8
Journal	Pattern Recognition Letters
Volume	132
DOIs	https://doi.org/10.1016/j.patrec.2018.10.019
Publication status	Published - Apr 2020
Externally published	Yes

ASJC Scopus subject areas

Software
Signal Processing
Computer Vision and Pattern Recognition
Artificial Intelligence

Access to Document

10.1016/j.patrec.2018.10.019

Cite this

@article{6fa89b8aa0c1471b94f483d517968f0a,

title = "Spatial–temporal multi-task learning for salient region detection",

abstract = "This paper proposes a novel multi-task learning based salient region detection method by fusing spatial and temporal features. Salient region detection has been widely used in various computer vision tasks, being as a general preprocessor to identify interest objects. Despite the recent successes, existing saliency models still lag behind the performance of human when visually perceives dynamic scenes. Most of the existing models largely rely on various spatial features. However, these spatial feature based methods have several deficiencies: (i) they can hardly adapt to the situation where moving objects are included, and (ii) they cannot model the human vision process in dynamic scenes. Recently, some saliency models introduce temporal features in their detecting process, such as the optical flow and stacking frames. The potential of temporal features for saliency optimization has been demonstrated. However, since temporal features in these models are merely used as a compensation to static features, the advantages of temporal features have not yet been fully explored. Aiming to comprehensively address these issues above, our method fuses spatial and temporal features, and learns the mapping relationship from various features to salient regions using our multi-task learning framework. The final salient region is generated by our unified Bayesian framework. The experimental results demonstrated that our proposed approach outperforms previous methods.",

author = "Zhe Chen and Ruili Wang and Ming Yu and Hongmin Gao and Qi Li and Huibin Wang",

note = "Publisher Copyright: {\textcopyright} 2018 Elsevier B.V.",

year = "2020",

month = apr,

doi = "10.1016/j.patrec.2018.10.019",

language = "English",

volume = "132",

pages = "76--83",

journal = "Pattern Recognition Letters",

issn = "0167-8655",

publisher = "Elsevier",

}

TY - JOUR

T1 - Spatial–temporal multi-task learning for salient region detection

AU - Chen, Zhe

AU - Wang, Ruili

AU - Yu, Ming

AU - Gao, Hongmin

AU - Li, Qi

AU - Wang, Huibin

PY - 2020/4

Y1 - 2020/4

N2 - This paper proposes a novel multi-task learning based salient region detection method by fusing spatial and temporal features. Salient region detection has been widely used in various computer vision tasks, being as a general preprocessor to identify interest objects. Despite the recent successes, existing saliency models still lag behind the performance of human when visually perceives dynamic scenes. Most of the existing models largely rely on various spatial features. However, these spatial feature based methods have several deficiencies: (i) they can hardly adapt to the situation where moving objects are included, and (ii) they cannot model the human vision process in dynamic scenes. Recently, some saliency models introduce temporal features in their detecting process, such as the optical flow and stacking frames. The potential of temporal features for saliency optimization has been demonstrated. However, since temporal features in these models are merely used as a compensation to static features, the advantages of temporal features have not yet been fully explored. Aiming to comprehensively address these issues above, our method fuses spatial and temporal features, and learns the mapping relationship from various features to salient regions using our multi-task learning framework. The final salient region is generated by our unified Bayesian framework. The experimental results demonstrated that our proposed approach outperforms previous methods.

AB - This paper proposes a novel multi-task learning based salient region detection method by fusing spatial and temporal features. Salient region detection has been widely used in various computer vision tasks, being as a general preprocessor to identify interest objects. Despite the recent successes, existing saliency models still lag behind the performance of human when visually perceives dynamic scenes. Most of the existing models largely rely on various spatial features. However, these spatial feature based methods have several deficiencies: (i) they can hardly adapt to the situation where moving objects are included, and (ii) they cannot model the human vision process in dynamic scenes. Recently, some saliency models introduce temporal features in their detecting process, such as the optical flow and stacking frames. The potential of temporal features for saliency optimization has been demonstrated. However, since temporal features in these models are merely used as a compensation to static features, the advantages of temporal features have not yet been fully explored. Aiming to comprehensively address these issues above, our method fuses spatial and temporal features, and learns the mapping relationship from various features to salient regions using our multi-task learning framework. The final salient region is generated by our unified Bayesian framework. The experimental results demonstrated that our proposed approach outperforms previous methods.

UR - http://www.scopus.com/inward/record.url?scp=85055480497&partnerID=8YFLogxK

U2 - 10.1016/j.patrec.2018.10.019

DO - 10.1016/j.patrec.2018.10.019

M3 - Article

AN - SCOPUS:85055480497

SN - 0167-8655

VL - 132

SP - 76

EP - 83

JO - Pattern Recognition Letters

JF - Pattern Recognition Letters

ER -

Spatial–temporal multi-task learning for salient region detection

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this