中国电力 ›› 2023, Vol. 56 ›› Issue (11): 77-85.DOI: 10.11930/j.issn.1004-9649.202305046

• 面向电网设备状态感知的低功耗无线传感网技术及应用 • 上一篇    下一篇

基于分组查询注意力的可扩展电力人员行为分类方法

周震震1(), 宋云海1, 何宇浩1, 王黎伟1, 黄和燕1, 何珏1, 朱志航2, 闫云凤2()   

  1. 1. 中国南方电网有限责任公司超高压输电公司检修试验中心,广东 广州 510663
    2. 浙江大学,浙江 杭州 310007
  • 收稿日期:2023-05-09 接受日期:2023-10-13 出版日期:2023-11-28 发布日期:2023-11-28
  • 作者简介:周震震(1986—),男,硕士,高级工程师,从事数字电网技术研究,E-mail: zhouzhenzhen@im.ehv.csg
    闫云凤(1988—),女,博士,从事计算机视觉技术研究,E-mail: 21210004@zju.edu.cn
  • 基金资助:
    浙江省自然科学基金资助项目(基于图像的电力异常检测关键技术研究与应用,LQ21F030017)。

Extensible Classification Method for Power Personnel Behavior Based on Pose Estimation

Zhenzhen ZHOU1(), Yunhai SONG1, Yuhao HE1, Liwei WANG1, Heyan HUANG1, Jue HE1, Zhihang ZHU2, Yunfeng YAN2()   

  1. 1. Overhaul and Test Center of UHV Transmission Company of China Southern Power Grid Co., Ltd., Guangzhou 510663, China
    2. Zhejiang University, Hangzhou 310007, China
  • Received:2023-05-09 Accepted:2023-10-13 Online:2023-11-28 Published:2023-11-28
  • Supported by:
    This work is supported by Natural Science Foundation of Zhejiang Province (Research and Application on Power Defects Detection Methods via Images, No.LQ21F030017).

摘要:

电力人员行为识别是电力系统安全运维的重要环节,现有的人员行为识别算法主要采用支持向量机和多层感知机进行行为分类,存在识别精度低、未考虑人体骨架之间交互关系、迁移性、通用性差等问题。针对上述问题,提出一种基于自注意力与交叉注意力机制的行为分类解码器,充分考虑了人体骨架之间的关联。其分类精度相比传统分类方法提升10%~20%,较深度学习多层感知机(multilayer perceptron,MLP)分类方法提升2%以上。该方法运用编码器-解码器架构的二阶段方法进行行为识别,使得解码器可以适用于任意姿态估计,网络后端具有很强的可扩展性。此外,采用分组解码的方式克服了注意力机制带来的二次方复杂度,使得该解码器可以扩展到更多行为类别,具有更好的普适性。该行为识别算法能够在基于变电站工作场景下的人员图像数据集验证中达到优异的识别效果,综合识别率达91.1%,验证了所提电力人员行为分类方法的有效性和适用性。

关键词: 姿态估计, 注意力机制, 行为识别, 深度学习

Abstract:

Power personnel behavior recognition is a critical component for the safe operation and maintenance of the power system. However, current personnel behavior recognition algorithms, which primarily rely on support vector machines and multi-layer perceptrons for behavior classification, have a number of shortcomings, including low recognition accuracy, insufficient consideration of the interactions between human skeletons, and poor mobility and universality. To address these challenges, we proposes a novel behavior classification decoder based on a self-attention and cross-attention mechanism, which fully considers the associations between human skeletons. Compared to the traditional classification methods, the proposed approach improves the classification accuracy by approximately 10%~20%, and outperforms the deep learning MLP classification methods by more than 2%. To implement behavior recognition, we use a two-stage encoder-decoder architecture method, which has good extensibility while making the decoder suitable for the back end of any pose estimation network. Additionally, we use a grouped decoding method to overcome the quadratic complexity induced by the attention mechanism, which enables the decoder to extend to include more behavior categories, thus being more universal. The proposed behavior recognition algorithm achieves the optimal recognition effect in the personnel image data set based on the substation working scenarios. The comprehensive recognition rate reaches 91.1%, which verifies the efficacy and practicality of the proposed power personnel behavior classification method.

Key words: pose estimation, attention mechanism, behavior recognition, deep learning