TY - GEN
T1 - Control of shared energy storage assets within building clusters using reinforcement learning
AU - Odonkor, Philip
AU - Lewis, Kemper
N1 - Publisher Copyright:
Copyright © 2018 ASME.
PY - 2018
Y1 - 2018
N2 - This work leverages the current state of the art in reinforcement learning for continuous control, the Deep Deterministic Policy Gradient (DDPG) algorithm, towards the optimal 24-hour dispatch of shared energy assets within building clusters. The modeled DDPG agent interacts with a battery environment, designed to emulate a shared battery system. The aim here is to not only learn an efficient charged/discharged policy, but to also address the continuous domain question of how much energy should be charged or discharged. Experimentally, we examine the impact of the learned dispatch strategy towards minimizing demand peaks within the building cluster. Our results show that across the variety of building cluster combinations studied, the algorithm is able to learn and exploit energy arbitrage, tailoring it into battery dispatch strategies for peak demand shifting.
AB - This work leverages the current state of the art in reinforcement learning for continuous control, the Deep Deterministic Policy Gradient (DDPG) algorithm, towards the optimal 24-hour dispatch of shared energy assets within building clusters. The modeled DDPG agent interacts with a battery environment, designed to emulate a shared battery system. The aim here is to not only learn an efficient charged/discharged policy, but to also address the continuous domain question of how much energy should be charged or discharged. Experimentally, we examine the impact of the learned dispatch strategy towards minimizing demand peaks within the building cluster. Our results show that across the variety of building cluster combinations studied, the algorithm is able to learn and exploit energy arbitrage, tailoring it into battery dispatch strategies for peak demand shifting.
KW - Battery storage
KW - Building cluster
KW - Deep deterministic policy gradients
KW - Reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85057064438&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85057064438&partnerID=8YFLogxK
U2 - 10.1115/DETC2018-86094
DO - 10.1115/DETC2018-86094
M3 - Conference contribution
AN - SCOPUS:85057064438
T3 - Proceedings of the ASME Design Engineering Technical Conference
BT - 44th Design Automation Conference
T2 - ASME 2018 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, IDETC/CIE 2018
Y2 - 26 August 2018 through 29 August 2018
ER -