TY - JOUR
T1 - Real-Time Optimal Power Flow Using Twin Delayed Deep Deterministic Policy Gradient Algorithm
AU - Woo, Jong Ha
AU - Wu, Lei
AU - Park, Jong Bae
AU - Roh, Jae Hyung
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2020
Y1 - 2020
N2 - The general concept of AC Optimal Power Flow (ACOPF) refers to the economic dispatch planning under electric network constraints. Moreover, each instance with the entire network must be solved in real-Time (i.e., every five minutes) to ensure cost-effective power system operation while satisfying power balance equation. As the operation of power systems penetrated with intermittent renewable energy becomes more complicated, this article proposes Deep Neural Network (DNN) and Levenberg-Marquardt backpropagation-based Twin Delayed Deep Deterministic Policy Gradient (TD3) approach to improve computational performance of ACOPF. Specifically, because the ACOPF model shall consider prevailing constraints of the power system, including power balance equation, we set the appropriate reward vector in the training process to build our own policy. Furthermore, we add random Gaussian noise to individual net loads for representing uncertainty characteristics introduced by renewable energy sources. Finally, the proposed model is compared with the MAT-POWER solution on the IEEE 118-bus system to demonstrate its efficacy and robustness.
AB - The general concept of AC Optimal Power Flow (ACOPF) refers to the economic dispatch planning under electric network constraints. Moreover, each instance with the entire network must be solved in real-Time (i.e., every five minutes) to ensure cost-effective power system operation while satisfying power balance equation. As the operation of power systems penetrated with intermittent renewable energy becomes more complicated, this article proposes Deep Neural Network (DNN) and Levenberg-Marquardt backpropagation-based Twin Delayed Deep Deterministic Policy Gradient (TD3) approach to improve computational performance of ACOPF. Specifically, because the ACOPF model shall consider prevailing constraints of the power system, including power balance equation, we set the appropriate reward vector in the training process to build our own policy. Furthermore, we add random Gaussian noise to individual net loads for representing uncertainty characteristics introduced by renewable energy sources. Finally, the proposed model is compared with the MAT-POWER solution on the IEEE 118-bus system to demonstrate its efficacy and robustness.
KW - Deep deterministic policy gradient
KW - Levenberg Marquardt
KW - deep reinforcement learning
KW - optimal power flow
KW - twin delayed deep deterministic policy gradient
UR - http://www.scopus.com/inward/record.url?scp=85097881615&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85097881615&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2020.3041007
DO - 10.1109/ACCESS.2020.3041007
M3 - Article
AN - SCOPUS:85097881615
VL - 8
SP - 213611
EP - 213618
JO - IEEE Access
JF - IEEE Access
M1 - 9272783
ER -