TY - JOUR
T1 - Distributional Reinforcement Learning Based Integrated Decision Making and Control for Autonomous Surface Vehicles
AU - Lin, Xi
AU - Szenher, Paul
AU - Huang, Yewei
AU - Englot, Brendan
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2024
Y1 - 2024
N2 - With the growing demands for Autonomous Surface Vehicles (ASVs) in recent years, the number of ASVs being deployed for various maritime missions is expected to increase rapidly in the near future. However, it is still challenging for ASVs to perform sensor-based autonomous navigation in obstacle-filled and congested waterways, where perception errors, closely gathered vehicles and limited maneuvering space near buoys may cause difficulties in following the Convention on the International Regulations for Preventing Collisions at Sea (COLREGs). To address these issues, we propose a novel Distributional Reinforcement Learning based navigation system that can work with onboard LiDAR and odometry sensors to generate arbitrary thrust commands in continuous action space. Comprehensive evaluations of the proposed system in high-fidelity Gazebo simulations show its ability to decide whether to follow COLREGs or take other beneficial actions based on the scenarios encountered, offering superior performance in navigation safety and efficiency compared to systems using state-of-the-art Distributional RL, non-Distributional RL and classical methods.
AB - With the growing demands for Autonomous Surface Vehicles (ASVs) in recent years, the number of ASVs being deployed for various maritime missions is expected to increase rapidly in the near future. However, it is still challenging for ASVs to perform sensor-based autonomous navigation in obstacle-filled and congested waterways, where perception errors, closely gathered vehicles and limited maneuvering space near buoys may cause difficulties in following the Convention on the International Regulations for Preventing Collisions at Sea (COLREGs). To address these issues, we propose a novel Distributional Reinforcement Learning based navigation system that can work with onboard LiDAR and odometry sensors to generate arbitrary thrust commands in continuous action space. Comprehensive evaluations of the proposed system in high-fidelity Gazebo simulations show its ability to decide whether to follow COLREGs or take other beneficial actions based on the scenarios encountered, offering superior performance in navigation safety and efficiency compared to systems using state-of-the-art Distributional RL, non-Distributional RL and classical methods.
KW - Autonomous Vehicle Navigation
KW - Marine Robotics
KW - Reinforcement Learning
UR - http://www.scopus.com/inward/record.url?scp=85212968130&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85212968130&partnerID=8YFLogxK
U2 - 10.1109/LRA.2024.3518839
DO - 10.1109/LRA.2024.3518839
M3 - Article
AN - SCOPUS:85212968130
JO - IEEE Robotics and Automation Letters
JF - IEEE Robotics and Automation Letters
ER -