Application of Actor-Critic Deep Reinforcement Learning Method for Obstacle Avoidance of WMR

Xiaoshan Gao; Liang Yan; Gang Wang; Zhuang He; Chris Gerada; Suokui Chang

doi:10.1007/978-981-15-8155-7_453

Application of Actor-Critic Deep Reinforcement Learning Method for Obstacle Avoidance of WMR

Xiaoshan Gao, Liang Yan, Gang Wang, Zhuang He, Chris Gerada, Suokui Chang

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

1 Citation (Scopus)

Abstract

A state-of-the-art framework, i.e., deep deterministic policy gradient (DDPG), has obtained a certain effect in the robotic control field. When the wheeled mobile robot (WMR) executes operation in unstructured environment, it is critical to endow the WMR with the capacity to avoid the static and dynamic obstacles. Thus, a obstacle avoidance algorithm based on DDPG is proposed to realize the autonomous navigation in the unknown environment. The WMR in this study installs the requisite sensors to provide the fully observable environment information at any moment. The continuous state space description for WMR and obstacles is designed, together with the reward mechanism and action space. The learning agent. i.e., the studied mobile robot, utilizes the DDPG model, through the continuous interaction with the surrounding environment and the application of historical experience data, the WMR can learn the optimal action behavior. Simulation along with test works strongly verify the collision-free ability in static and dynamic scenarios with multiple observable obstacles.

Original language	English
Title of host publication	Advances in Guidance, Navigation and Control - Proceedings of 2020 International Conference on Guidance, Navigation and Control, ICGNC 2020
Editors	Liang Yan, Haibin Duan, Xiang Yu
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	5485-5494
Number of pages	10
ISBN (Print)	9789811581540
DOIs	https://doi.org/10.1007/978-981-15-8155-7_453
Publication status	Published - 2022
Externally published	Yes
Event	International Conference on Guidance, Navigation and Control, ICGNC 2020 - Tianjin, China Duration: 23 Oct 2020 → 25 Oct 2020

Publication series

Name	Lecture Notes in Electrical Engineering
Volume	644 LNEE
ISSN (Print)	1876-1100
ISSN (Electronic)	1876-1119

Conference

Conference	International Conference on Guidance, Navigation and Control, ICGNC 2020
Country/Territory	China
City	Tianjin
Period	23/10/20 → 25/10/20

Keywords

Deep reinforcement learning
Obstacle avoidance
Wheeled mobile robot

ASJC Scopus subject areas

Industrial and Manufacturing Engineering

Access to Document

10.1007/978-981-15-8155-7_453

Cite this

Gao, X., Yan, L., Wang, G., He, Z., Gerada, C., & Chang, S. (2022). Application of Actor-Critic Deep Reinforcement Learning Method for Obstacle Avoidance of WMR. In L. Yan, H. Duan, & X. Yu (Eds.), Advances in Guidance, Navigation and Control - Proceedings of 2020 International Conference on Guidance, Navigation and Control, ICGNC 2020 (pp. 5485-5494). (Lecture Notes in Electrical Engineering; Vol. 644 LNEE). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-15-8155-7_453

Gao, Xiaoshan ; Yan, Liang ; Wang, Gang et al. / Application of Actor-Critic Deep Reinforcement Learning Method for Obstacle Avoidance of WMR. Advances in Guidance, Navigation and Control - Proceedings of 2020 International Conference on Guidance, Navigation and Control, ICGNC 2020. editor / Liang Yan ; Haibin Duan ; Xiang Yu. Springer Science and Business Media Deutschland GmbH, 2022. pp. 5485-5494 (Lecture Notes in Electrical Engineering).

@inproceedings{cf33c31475aa42b7ac30b1d10b665cc6,

title = "Application of Actor-Critic Deep Reinforcement Learning Method for Obstacle Avoidance of WMR",

abstract = "A state-of-the-art framework, i.e., deep deterministic policy gradient (DDPG), has obtained a certain effect in the robotic control field. When the wheeled mobile robot (WMR) executes operation in unstructured environment, it is critical to endow the WMR with the capacity to avoid the static and dynamic obstacles. Thus, a obstacle avoidance algorithm based on DDPG is proposed to realize the autonomous navigation in the unknown environment. The WMR in this study installs the requisite sensors to provide the fully observable environment information at any moment. The continuous state space description for WMR and obstacles is designed, together with the reward mechanism and action space. The learning agent. i.e., the studied mobile robot, utilizes the DDPG model, through the continuous interaction with the surrounding environment and the application of historical experience data, the WMR can learn the optimal action behavior. Simulation along with test works strongly verify the collision-free ability in static and dynamic scenarios with multiple observable obstacles.",

keywords = "Deep reinforcement learning, Obstacle avoidance, Wheeled mobile robot",

author = "Xiaoshan Gao and Liang Yan and Gang Wang and Zhuang He and Chris Gerada and Suokui Chang",

note = "Publisher Copyright: {\textcopyright} 2022, The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.; International Conference on Guidance, Navigation and Control, ICGNC 2020 ; Conference date: 23-10-2020 Through 25-10-2020",

year = "2022",

doi = "10.1007/978-981-15-8155-7_453",

language = "English",

isbn = "9789811581540",

series = "Lecture Notes in Electrical Engineering",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "5485--5494",

editor = "Liang Yan and Haibin Duan and Xiang Yu",

booktitle = "Advances in Guidance, Navigation and Control - Proceedings of 2020 International Conference on Guidance, Navigation and Control, ICGNC 2020",

address = "Germany",

}

Gao, X, Yan, L, Wang, G, He, Z, Gerada, C & Chang, S 2022, Application of Actor-Critic Deep Reinforcement Learning Method for Obstacle Avoidance of WMR. in L Yan, H Duan & X Yu (eds), Advances in Guidance, Navigation and Control - Proceedings of 2020 International Conference on Guidance, Navigation and Control, ICGNC 2020. Lecture Notes in Electrical Engineering, vol. 644 LNEE, Springer Science and Business Media Deutschland GmbH, pp. 5485-5494, International Conference on Guidance, Navigation and Control, ICGNC 2020, Tianjin, China, 23/10/20. https://doi.org/10.1007/978-981-15-8155-7_453

Application of Actor-Critic Deep Reinforcement Learning Method for Obstacle Avoidance of WMR. / Gao, Xiaoshan; Yan, Liang; Wang, Gang et al.
Advances in Guidance, Navigation and Control - Proceedings of 2020 International Conference on Guidance, Navigation and Control, ICGNC 2020. ed. / Liang Yan; Haibin Duan; Xiang Yu. Springer Science and Business Media Deutschland GmbH, 2022. p. 5485-5494 (Lecture Notes in Electrical Engineering; Vol. 644 LNEE).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Application of Actor-Critic Deep Reinforcement Learning Method for Obstacle Avoidance of WMR

AU - Gao, Xiaoshan

AU - Yan, Liang

AU - Wang, Gang

AU - He, Zhuang

AU - Gerada, Chris

AU - Chang, Suokui

PY - 2022

Y1 - 2022

N2 - A state-of-the-art framework, i.e., deep deterministic policy gradient (DDPG), has obtained a certain effect in the robotic control field. When the wheeled mobile robot (WMR) executes operation in unstructured environment, it is critical to endow the WMR with the capacity to avoid the static and dynamic obstacles. Thus, a obstacle avoidance algorithm based on DDPG is proposed to realize the autonomous navigation in the unknown environment. The WMR in this study installs the requisite sensors to provide the fully observable environment information at any moment. The continuous state space description for WMR and obstacles is designed, together with the reward mechanism and action space. The learning agent. i.e., the studied mobile robot, utilizes the DDPG model, through the continuous interaction with the surrounding environment and the application of historical experience data, the WMR can learn the optimal action behavior. Simulation along with test works strongly verify the collision-free ability in static and dynamic scenarios with multiple observable obstacles.

AB - A state-of-the-art framework, i.e., deep deterministic policy gradient (DDPG), has obtained a certain effect in the robotic control field. When the wheeled mobile robot (WMR) executes operation in unstructured environment, it is critical to endow the WMR with the capacity to avoid the static and dynamic obstacles. Thus, a obstacle avoidance algorithm based on DDPG is proposed to realize the autonomous navigation in the unknown environment. The WMR in this study installs the requisite sensors to provide the fully observable environment information at any moment. The continuous state space description for WMR and obstacles is designed, together with the reward mechanism and action space. The learning agent. i.e., the studied mobile robot, utilizes the DDPG model, through the continuous interaction with the surrounding environment and the application of historical experience data, the WMR can learn the optimal action behavior. Simulation along with test works strongly verify the collision-free ability in static and dynamic scenarios with multiple observable obstacles.

KW - Deep reinforcement learning

KW - Obstacle avoidance

KW - Wheeled mobile robot

UR - http://www.scopus.com/inward/record.url?scp=85120634140&partnerID=8YFLogxK

U2 - 10.1007/978-981-15-8155-7_453

DO - 10.1007/978-981-15-8155-7_453

M3 - Conference contribution

AN - SCOPUS:85120634140

SN - 9789811581540

T3 - Lecture Notes in Electrical Engineering

SP - 5485

EP - 5494

BT - Advances in Guidance, Navigation and Control - Proceedings of 2020 International Conference on Guidance, Navigation and Control, ICGNC 2020

A2 - Yan, Liang

A2 - Duan, Haibin

A2 - Yu, Xiang

PB - Springer Science and Business Media Deutschland GmbH

T2 - International Conference on Guidance, Navigation and Control, ICGNC 2020

Y2 - 23 October 2020 through 25 October 2020

ER -

Gao X, Yan L, Wang G, He Z, Gerada C, Chang S. Application of Actor-Critic Deep Reinforcement Learning Method for Obstacle Avoidance of WMR. In Yan L, Duan H, Yu X, editors, Advances in Guidance, Navigation and Control - Proceedings of 2020 International Conference on Guidance, Navigation and Control, ICGNC 2020. Springer Science and Business Media Deutschland GmbH. 2022. p. 5485-5494. (Lecture Notes in Electrical Engineering). doi: 10.1007/978-981-15-8155-7_453

Application of Actor-Critic Deep Reinforcement Learning Method for Obstacle Avoidance of WMR

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this