Electric Power ›› 2023, Vol. 56 ›› Issue (9): 127-133.DOI: 10.11930/j.issn.1004-9649.202210068

• Distribution Network Planning and Optimized Operation • Previous Articles     Next Articles

Adversarial Reinforcement Learning-Based Converged Communication Efficiency Improvement Method for Power Distribution Network

PENG Linyu1, LIU Xu1, TANG Wei1, LIU Qing1, FANG Hao2, ZHANG Guanghui3   

  1. 1. Guizhou Power Grid Co., Ltd., Guiyang 550002, China;
    2. Guiyang Power Supply Bureau of Guizhou Power Grid Co., Ltd., Guiyang 550004, China;
    3. Guizhou Power Dispatching & Communication Center, Guiyang 550002, China
  • Received:2022-10-18 Revised:2023-08-10 Accepted:2023-01-16 Online:2023-09-23 Published:2023-09-28
  • Supported by:
    This work is supported by Science and Technology Project of China Southern Power Grid Corporation (No.066500GS62200017).

Abstract: In order to satisfy the diversified communication requirements of terminal source nodes in power distribution network, it is necessary to optimize the communication orchestration in power distribution unified communication network. Firstly, we construct the joint optimization problem of data transmission delay and energy consumption. Then, the joint optimization problem is modeled as a multi-armed bandit problem, and an adversarial reinforcement learning-based communication orchestration algorithm for power distribution unified communication network is proposed, which uses the historical orchestration information and the perceived adversary between source nodes to dynamically learn the communication orchestration strategy. Finally, the superior performance of the proposed algorithm is verified through simulation.

Key words: distribution network, reinforcement learning, adversary awareness, communication orchestration