TY - GEN
T1 - Robust Route Planning with Distributional Reinforcement Learning in a Stochastic Road Network Environment
AU - Lin, Xi
AU - Szenher, Paul
AU - Martin, John D.
AU - Englot, Brendan
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Route planning is essential to mobile robot navigation problems. In recent years, deep reinforcement learning (DRL) has been applied to learning optimal planning policies in stochastic environments without prior knowledge. However, existing works focus on learning policies that maximize the expected return, the performance of which can vary greatly when the level of stochasticity in the environment is high. In this work, we propose a distributional reinforcement learning based framework that learns return distributions which explicitly reflect environmental stochasticity. Policies based on the second-order stochastic dominance (SSD) relation can be used to make adjustable route decisions according to user preference on performance robustness. Our proposed method is evaluated in a simulated road network environment, and experimental results show that our method is able to plan the shortest routes that minimize stochasticity in travel time when robustness is preferred, while other state-of-the-art DRL methods are agnostic to environmental stochasticity.
AB - Route planning is essential to mobile robot navigation problems. In recent years, deep reinforcement learning (DRL) has been applied to learning optimal planning policies in stochastic environments without prior knowledge. However, existing works focus on learning policies that maximize the expected return, the performance of which can vary greatly when the level of stochasticity in the environment is high. In this work, we propose a distributional reinforcement learning based framework that learns return distributions which explicitly reflect environmental stochasticity. Policies based on the second-order stochastic dominance (SSD) relation can be used to make adjustable route decisions according to user preference on performance robustness. Our proposed method is evaluated in a simulated road network environment, and experimental results show that our method is able to plan the shortest routes that minimize stochasticity in travel time when robustness is preferred, while other state-of-the-art DRL methods are agnostic to environmental stochasticity.
UR - http://www.scopus.com/inward/record.url?scp=85169450732&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85169450732&partnerID=8YFLogxK
U2 - 10.1109/UR57808.2023.10202222
DO - 10.1109/UR57808.2023.10202222
M3 - Conference contribution
AN - SCOPUS:85169450732
T3 - 2023 20th International Conference on Ubiquitous Robots, UR 2023
SP - 287
EP - 294
BT - 2023 20th International Conference on Ubiquitous Robots, UR 2023
T2 - 20th International Conference on Ubiquitous Robots, UR 2023
Y2 - 25 June 2023 through 28 June 2023
ER -