Control of shared energy storage assets within building clusters using reinforcement learning

Philip Odonkor, Kemper Lewis

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    8 Scopus citations

    Abstract

    This work leverages the current state of the art in reinforcement learning for continuous control, the Deep Deterministic Policy Gradient (DDPG) algorithm, towards the optimal 24-hour dispatch of shared energy assets within building clusters. The modeled DDPG agent interacts with a battery environment, designed to emulate a shared battery system. The aim here is to not only learn an efficient charged/discharged policy, but to also address the continuous domain question of how much energy should be charged or discharged. Experimentally, we examine the impact of the learned dispatch strategy towards minimizing demand peaks within the building cluster. Our results show that across the variety of building cluster combinations studied, the algorithm is able to learn and exploit energy arbitrage, tailoring it into battery dispatch strategies for peak demand shifting.

    Original languageEnglish
    Title of host publication44th Design Automation Conference
    ISBN (Electronic)9780791851753
    DOIs
    StatePublished - 2018
    EventASME 2018 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, IDETC/CIE 2018 - Quebec City, Canada
    Duration: 26 Aug 201829 Aug 2018

    Publication series

    NameProceedings of the ASME Design Engineering Technical Conference
    Volume2A-2018

    Conference

    ConferenceASME 2018 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, IDETC/CIE 2018
    Country/TerritoryCanada
    CityQuebec City
    Period26/08/1829/08/18

    Keywords

    • Battery storage
    • Building cluster
    • Deep deterministic policy gradients
    • Reinforcement learning

    Fingerprint

    Dive into the research topics of 'Control of shared energy storage assets within building clusters using reinforcement learning'. Together they form a unique fingerprint.

    Cite this