Abstract
Autonomous navigation of Unmanned Surface Vehicles (USV) in marine environments with current flows is challenging, and few prior works have addressed the sensor-based navigation problem in such environments under no prior knowledge of the current flow and obstacles. We propose a Distributional Reinforcement Learning (RL) based local path planner that learns return distributions which capture the uncertainty of action outcomes, and an adaptive algorithm that automatically tunes the level of sensitivity to the risk in the environment. The proposed planner achieves a more stable learning performance and converges to safer policies than a traditional RL based planner. Computational experiments demonstrate that comparing to a traditional RL based planner and classical local planning methods such as Artificial Potential Fields and the Bug Algorithm, the proposed planner is robust against environmental flows, and is able to plan trajectories that are superior in safety, time and energy consumption.
| Original language | English |
|---|---|
| Title of host publication | 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2023 |
| Pages | 6185-6191 |
| Number of pages | 7 |
| ISBN (Electronic) | 9781665491907 |
| DOIs | |
| State | Published - 2023 |
| Event | 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2023 - Detroit, United States Duration: 1 Oct 2023 → 5 Oct 2023 |
Publication series
| Name | IEEE International Conference on Intelligent Robots and Systems |
|---|---|
| ISSN (Print) | 2153-0858 |
| ISSN (Electronic) | 2153-0866 |
Conference
| Conference | 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2023 |
|---|---|
| Country/Territory | United States |
| City | Detroit |
| Period | 1/10/23 → 5/10/23 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
-
SDG 14 Life Below Water
Fingerprint
Dive into the research topics of 'Robust Unmanned Surface Vehicle Navigation with Distributional Reinforcement Learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver