Key frame extraction for falling detection

2021 International Conference on Information Technology and Biomedical Engineering (ICITBE) Pub Date : 2021-12-01 DOI:10.1109/ICITBE54178.2021.00032

Jing Du, Yale Zhao, Shanna Zhuang, Zhengyou Wang

{"title":"Key frame extraction for falling detection","authors":"Jing Du, Yale Zhao, Shanna Zhuang, Zhengyou Wang","doi":"10.1109/ICITBE54178.2021.00032","DOIUrl":null,"url":null,"abstract":"The continuous advancement of computer vision and deep learning technology provides powerful technical support for supervising the occurrence of falling behaviors. Considering that some frames in the video contribute little to the recognition of falling behaviors, in order to eliminate the video frames irrelevant to falling behaviors, this paper proposes an algorithm to extract key frames in a video by combining LUV local maximum and Mask-RCNN. Firstly, candidate key frames are extracted based on the local maximum of LUV. Video frames containing motion changes can be obtained. Afterwards, in order to further eliminate the video frames that are less relevant to the fall action, Mask-RCNN is used for human body detection. According to the aspect ratio of the bounding box and the motion speed of the human body, the video frames that are more relevant to the fall action are selected as key frames. Experiments and results analysis are carried out on the UR fall detection dataset, Multiple cameras fall dataset, Le2i Fall detection dataset and falling video dataset of real scenes. The effectiveness and accuracy of the proposed method are verified.","PeriodicalId":207276,"journal":{"name":"2021 International Conference on Information Technology and Biomedical Engineering (ICITBE)","volume":"15 6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Information Technology and Biomedical Engineering (ICITBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITBE54178.2021.00032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

The continuous advancement of computer vision and deep learning technology provides powerful technical support for supervising the occurrence of falling behaviors. Considering that some frames in the video contribute little to the recognition of falling behaviors, in order to eliminate the video frames irrelevant to falling behaviors, this paper proposes an algorithm to extract key frames in a video by combining LUV local maximum and Mask-RCNN. Firstly, candidate key frames are extracted based on the local maximum of LUV. Video frames containing motion changes can be obtained. Afterwards, in order to further eliminate the video frames that are less relevant to the fall action, Mask-RCNN is used for human body detection. According to the aspect ratio of the bounding box and the motion speed of the human body, the video frames that are more relevant to the fall action are selected as key frames. Experiments and results analysis are carried out on the UR fall detection dataset, Multiple cameras fall dataset, Le2i Fall detection dataset and falling video dataset of real scenes. The effectiveness and accuracy of the proposed method are verified.

查看原文本刊更多论文

关键帧提取的下落检测

计算机视觉和深度学习技术的不断进步，为监控跌倒行为的发生提供了强有力的技术支持。针对视频中某些帧对坠落行为识别贡献不大的问题，为了剔除与坠落行为无关的视频帧，本文提出了一种结合LUV局部极大值和Mask-RCNN提取视频中关键帧的算法。首先，基于LUV的局部最大值提取候选关键帧;可以获得包含运动变化的视频帧。之后，为了进一步剔除与跌倒动作不太相关的视频帧，使用Mask-RCNN进行人体检测。根据边界框的宽高比和人体的运动速度，选择与跌倒动作更相关的视频帧作为关键帧。对UR跌倒检测数据集、多摄像头跌倒数据集、Le2i跌倒检测数据集和真实场景的跌倒视频数据集进行了实验和结果分析。验证了该方法的有效性和准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 International Conference on Information Technology and Biomedical Engineering (ICITBE)

自引率

0.00%

发文量