Electric Power ›› 2023, Vol. 56 ›› Issue (2): 68-76.DOI: 10.11930/j.issn.1004-9649.202107065

• Power System • Previous Articles     Next Articles

Deep Deterministic Policy Gradient Algorithm Based Wind-Photovoltaic-Storage Hybrid System Joint Dispatch

ZHANG Shuxing1, MA Chi2, YANG Zhixue3, WANG Yao1, WU Hao1, REN Zhouyang3   

  1. 1. China Nuclear Power Technology Research Institute Co., Ltd., Shenzhen 518000, China;
    2. CGN New Energy Holdings Co., Ltd., Beijing 100084, China;
    3. State Key Laboratory of Power Transmission Equipment & System Security and New Technology (Chongqing University), Chongqing 400044, China
  • Received:2021-08-10 Revised:2022-12-16 Accepted:2021-11-08 Online:2023-02-23 Published:2023-02-28
  • Supported by:
    This work is supported by National Natural Science Foundation of China (No.51677012).

Abstract: A deep reinforcement learning based wind-photovoltaic-storage system joint dispatch model is proposed. First, a joint dispatch model that fully considers the constraints of various wind and solar storage stations is established, where tracking dispatch plans, wind and solar curtailment, and energy storage operation costs are considered in the objective function. Then, the state variables, action variables and reward function under the reinforcement learning framework are defined. Later, a deep deterministic policy gradient algorithm is introduced, using its environmental interaction and strategy exploration mechanism to learn the joint scheduling strategy, so as to achieve the dispatch strategy tracking, reduce wind and solar abandonment, and reduce energy storage charging and discharging. Finally, the historical data of wind power, photovoltaic, and dispatch plan in a certain area of northwestern China are employed to train and analyze the model. The results of the case studies show that the proposed method can adapt well to the changes in the wind power and photovoltaic power in different periods, and the joint scheduling strategy can be obtained under given data of wind and photovoltaic.

Key words: wind-photovoltaic-storage hybrid system, joint scheduling strategy, uncertainty, deep reinforcement learning, deep deterministic policy gradient algorithm