Electric Power ›› 2024, Vol. 57 ›› Issue (3): 43-50.DOI: 10.11930/j.issn.1004-9649.202311065

• New Type Distribution Network Driven by Digital Technology • Previous Articles     Next Articles

Coordinated Optimization of Active and Reactive Power of Active Distribution Network Based on Safety Reinforcement Learning

Hao JIAO1(), Yanyan YIN2(), Chen WU3(), Jian LIU1, Chunlei XU3, Xian XU3, Guoqiang SUN2   

  1. 1. State Grid Jiangsu Electric Power Science Research Institute, Nanjing 211103, China
    2. College of Electrical and Power Engineering, Hohai University, Nanjing 211100, China
    3. State Grid Jiangsu Electric Power Co., Ltd., Nanjing 210024, China
  • Received:2023-11-15 Accepted:2024-02-13 Online:2024-03-23 Published:2024-03-28
  • Supported by:
    This work is supported by National Natural Science Foundation of China (No.U1966205) and Science and Technology Project of State Grid Jiangsu Electric Power Co., Ltd. (No.J2023121).

Abstract:

A safe reinforcement learning method based on offline strategies is proposed. Through offline training of a large amount of historical operating data of the distribution network, it gets rid of the traditional optimization method. Dependence on complete and accurate models. First, combined with the distribution network parameter information, an active and reactive power optimization model based on the constrained Markov decision process (CMDP) was established; then, a new safety reinforcement learning method was designed based on the original dual optimization method. The cost function is minimized while maximizing future discount rewards; finally, simulations are performed on power distribution system. The simulation results show that the proposed method can online generate a dispatching strategy that satisfies complex constraints and has economic benefits based on real-time observation information of the distribution network.

Key words: active distribution network, active and reactive power coordination optimization, safety reinforcement learning