Electric Power ›› 2025, Vol. 58 ›› Issue (5): 11-20, 32.DOI: 10.11930/j.issn.1004-9649.202408092

• Artificial Intelligence and New Energy Technologies for New Power Distribution Systems • Previous Articles     Next Articles

Two-layer Optimization Scheduling for Off-grid Microgrids Based on Multi-agent Deep Policy Gradient

FAN Huicong1(), DUAN Zhiguo2, CHEN Zhiyong1, ZHU Shijia1, LIU Hang3, LI Wenxiao1(), YANG Yang3   

  1. 1. Economic and Technological Research Institute, State Grid Hebei Electric Power Co., Ltd., Shijiazhuang 050000, China
    2. State Grid Hebei Electric Power Co., Ltd., Shijiazhuang 050000, China
    3. Handan Electric Power Supply Company, State Grid Hebei Electric Power Co., Ltd., Handan 056000, China
  • Received:2024-08-26 Online:2025-05-30 Published:2025-05-28
  • Supported by:
    This work is supported by the Science And Technology Project of SGCC (No.5400-202313823A-4-1-KJ).

Abstract:

To address the voltage limit violations and bidirectional power flow problems arising from high-penetration integration of distributed renewable energy, this paper proposes a two-layer active-reactive power cooperative optimization method to achieve cooperative optimal dispatch of active and reactive power in off-grid microgrids, ensuring the secure and stable operation of the system while enhancing operational economy. The lower-level model optimizes slow-regulating discrete devices based on mixed-integer second-order cone programming, while the upper-level model optimizes fast-regulating continuous devices using a multi-agent deep policy gradient algorithm. The two-layer model coordinates both active and reactive power flows of the microgrid, enabling real-time monitoring of the microgrid's status and online decision-making for the optimization of device regulation, without reliance on precise power flow models or complex communication systems. Finally, the feasibility and effectiveness of the two-layer optimization model are validated in the improved IEEE 33-bus microgrid system.

Key words: off-grid microgrid, multi-agent, deep reinforcement learning, mixed integer linear programming, multi-time scale, active-reactive power cooperative optimization