面向水域人员的不安全行为识别算法中国测试科技资讯平台

作者：何赟泽¹, 周辉¹, 吴兴辉¹, 任丹彤¹, 丁美有¹, 程亮^2,3

作者单位：1. 湖南大学电气与信息工程学院, 湖南长沙 410006;
2. 江苏海洋大学海洋工程学院, 江苏连云港 222005;
3. 珠海云洲智能科技有限公司, 广东珠海 519085

关键词：水域环境;行为识别;水域人员动作行为数据集;SlowFast算法改进

摘要：

为保障水域人员生命安全，针对水域环境人员不安全行为识别问题，提出一种基于Faster-RCNN+改进SlowFast的水域人员行为识别网络。主要工作包括以下几点：第一：根据采集的95段视频制作水域人员动作行为数据集，其中共有161687张快通道帧，5543张关键帧，共计标注框9173个。第二：使用Faster-RCNN+SlowFast模块化设计对水域人员动作行为数据集进行目标定位及行为分析，其实验结果表现良好，可以识别水域人员位置及行为。第三：提出一种与时域压缩策略相结合方式的改进SlowFast网络，结果表明改进SlowFast网络模型识别准确率提高5.0%，推理速度得到约1.14倍提升，实验证明可有效帮助识别水域人员的行为识别问题。

Unsafe behavior recognition algorithm and application for water personnel
HE Yunze, ZHOU Hui, WU Xinghui, REN Dantong, DING Meiyou, CHENG Liang
1. College of Electrical and Information Engineering, Hunan University, Changsha 410006, China;
2. School of Ocean Engineering, Jiangsu Ocean University, Lianyungang 222005, China;
3. Zhuhai Yunzhou IntelligentTechnology Co., Ltd., Zhuhai 519085, China
Abstract: In order to ensure the life safety of water personnel, a behavior recognition network based on Faster-RCNN+SlowFast improved was proposed for unsafe behavior recognition of water personnel. The main work includes the following points: First, according to the 95 videos collected, the dataset of water area personnel's action behavior is constructed, in which there are 161687 fast channel frames, 5543 key frames and a total of 9173 labeling frames. Secondly, the Faster-RCNN+SlowFast modular design is used to conduct target location and behavior analysis on the dataset of personnel's action behavior in water area. The experimental results show good performance and can identify the location and behavior of personnel in water area. Thirdly, an improved SlowFast network combined with time-domain compression strategy is proposed. The results show that the recognition accuracy of the improved SlowFast network model increases by 5.0%, and the inference speed increases by about 1.14 times. The experiment proves that the improved network model can effectively help to identify the behavior recognition problms of people in water areas.
Keywords: water environment;behavior recognition;water personnel action behavior dataset;improvement of SlowFast
2023, 49(9):104-110 收稿日期: 2022-6-28;收到修改稿日期: 2022-9-2
基金项目:
作者简介: 何赟泽（1983-），男，山西祁县人，教授，主要研究方向为嵌入式人工智能与边缘计算、红外热成像与机器视觉。
参考文献
[1] 杨刚, 张宇姝, 宋震. 人体动作识别与评价——区别、联系及研究进展[J]. 计算机科学与探索, 2022, 16(5): 991-1007.
[2] 杜思远, 李杰, 郑涛, 等. 基于三子样更新的EKF低成本MEMS姿态估计算法[J]. 中国测试, 2018, 44(3): 38-43.
[3] 郭海艳, 程亮, 杨春利, 等. 面向水面无人艇的目标检测与船舶分类系统研究[J]. 中国测试, 2023, 49(6): 114-121.
[4] 黄晴晴, 周风余, 刘美珍. 基于视频的人体动作识别算法综述[J]. 计算机应用研究, 2020, 37(11): 3213-3219.
[5] 乔羽. 基于Mask R-CNN泳池中溺水行为检测系统的设计与实现[D]. 青岛: 青岛大学, 2019.
[6] FEICHTENHOFER C, FAN H, MALIK J, et al. Slowfast networks for video recognition[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019.
[7] NIU J, YANG R, GUAN W, et al. Spatial-temporal graph convolutional networks for action recognition with adjacency matrix generation network[C]//2nd International Conference on Electronics, Communications and Information Technology (CECIT), 2021.
[8] HUYNH T T, HUA C H, TU N A, et al. Learning 3D spatiotemporal gait feature by convolutional network for person identification[J]. Neurocomputing, 2020, 397(3): 192-202.
[9] TAN D, HE C, WANG Y. Action recognition model based on feature interaction[C]//Proceedings of the 11th International Conference on Computer Engineering and Networks, 2021.
[10] ULLAH A, AHMAD J, MUHAMMAD K, et al. Action recognition in video sequences using deep bi-directional LSTM with CNN features[J]. IEEE access, 2017, 6(99): 1155-1166.
[11] 杨清山, 穆太江. 采用蒸馏训练的时空图卷积动作识别融合模型[J]. 中国图象图形学报, 2022, 27(4): 1290-1301.
[12] SIMONYAN K, ZISSERMAN A. Two-stream convolutional networks for action recognition in videos[J]. Advances in neural information processing systems, 2014, 27(1): 568-576.
[13] WU H, SONG C, YUE S, et al. Dynamic video mix-up for cross-domain action recognition[J]. Neurocomputing, 2022, 471(6): 358-368.
[14] GU C, SUN C, ROSS D A, et al. Ava: A video dataset of spatio-temporally localized atomic visual actions[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018.
[15] 郭雨青, 曾庆军, 夏楠, 等. 图像增强水下自主机器人目标识别研究[J]. 中国测试, 2021, 47(11): 47-52.
[16] 彭伟康, 陈爱军, 吴东明, 等. 基于改进Faster R-CNN的水准泡缺陷检测方法[J]. 中国测试, 2021, 47(7): 6-12.

数字期刊群

6大专业栏目，满足更精准的内容需求

《中国测试》期刊

了解我们，实时跟进期刊出版

行业期刊

测试计量专业期刊论文数据库

资料下载

学习工具和知识点的集合库

视听课堂

中国测试独家视频

言论图书馆

行业参考书籍

学术会议

追踪学术研究热点

行业新闻

获取业界最新要闻

专家风采

关注学术大牛动向

杂志社动态

聚焦中国测试杂志社

科普课堂

用科学知识，引领智慧生活

专题页

用科学知识，引领智慧生活

科技情报

获取业界最新要闻

产业情报

关注学术大牛动向

面向水域人员的不安全行为识别算法

量值传递中绝对测量与相对测量转化实例的数理分析(一)

三维结构形变的单目像机测量方法

光纤陀螺标度因数与零偏测试及评价方法研究

声源定位系统校准研究与不确定度分析

低空慢速小目标探测与定位技术研究

一种相似性框架下基于非线性扩散过程的剩余寿命估计模型

免费

免费

免费

免费