Action Recognition from Low-Resolution Infrared Sensor for Indoor use: A Comparative Study between Deep Learning and Classical Approaches

2019 20th IEEE International Conference on Mobile Data Management (MDM) Pub Date : 2019-06-10 DOI:10.1109/MDM.2019.00-11

Félix Polla, H. Laurent, B. Emile

{"title":"Action Recognition from Low-Resolution Infrared Sensor for Indoor use: A Comparative Study between Deep Learning and Classical Approaches","authors":"Félix Polla, H. Laurent, B. Emile","doi":"10.1109/MDM.2019.00-11","DOIUrl":null,"url":null,"abstract":"In recent years, automatic action recognition has attracted a lot of attention in the field of computer vision. In this paper, action recognition in an indoor environment using low-resolution sensor is considered. In this environment, several studies have been carried out in visible imagery. Despite impressive performances, the main pitfall faced by the techniques developed is a marked restraint of the users to be filmed. To cope with the problems of personal identity revealed during the surveillance, we opted within the CoCAPS project for using a low resolution (64 x 64 pixels) infrared sensor with ceiling position which guarantees the respect of the intimacy of the person. Defined in collaboration with the industrial partners involved in the CoCAPS project, the scenarios targeted concern office situations and 7 classes of action have been selected, namely: no action, restlessness, sitting down, standing up, turning sitting on a seat, slow walking, fast walking. Classical approaches such as those based on the computation of shape descriptors (such as geometric and Color Histogram of Oriented Phases (CHOP)) extracted from Motion History Image (MHI) are investigated to represent action video sequences. Within this first group of classical approaches, the performance of a proposed model based on statistical attributes constructed from the tracking of centers of gravity of segmented forms is also presented. The comparative study is then completed by considering other models extracted from deep learning literature (convolutional neural networks (3D-CNN), Long Short Term Memory (LSTM)). The results obtained from the comparative study show that the proposed model is very competitive and provides promising results (83% of f-score).","PeriodicalId":241426,"journal":{"name":"2019 20th IEEE International Conference on Mobile Data Management (MDM)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 20th IEEE International Conference on Mobile Data Management (MDM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MDM.2019.00-11","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

In recent years, automatic action recognition has attracted a lot of attention in the field of computer vision. In this paper, action recognition in an indoor environment using low-resolution sensor is considered. In this environment, several studies have been carried out in visible imagery. Despite impressive performances, the main pitfall faced by the techniques developed is a marked restraint of the users to be filmed. To cope with the problems of personal identity revealed during the surveillance, we opted within the CoCAPS project for using a low resolution (64 x 64 pixels) infrared sensor with ceiling position which guarantees the respect of the intimacy of the person. Defined in collaboration with the industrial partners involved in the CoCAPS project, the scenarios targeted concern office situations and 7 classes of action have been selected, namely: no action, restlessness, sitting down, standing up, turning sitting on a seat, slow walking, fast walking. Classical approaches such as those based on the computation of shape descriptors (such as geometric and Color Histogram of Oriented Phases (CHOP)) extracted from Motion History Image (MHI) are investigated to represent action video sequences. Within this first group of classical approaches, the performance of a proposed model based on statistical attributes constructed from the tracking of centers of gravity of segmented forms is also presented. The comparative study is then completed by considering other models extracted from deep learning literature (convolutional neural networks (3D-CNN), Long Short Term Memory (LSTM)). The results obtained from the comparative study show that the proposed model is very competitive and provides promising results (83% of f-score).

查看原文本刊更多论文

室内低分辨率红外传感器的动作识别:深度学习与经典方法的比较研究

近年来，自动动作识别在计算机视觉领域引起了广泛的关注。本文研究了在室内环境下使用低分辨率传感器的动作识别问题。在这种环境下，已经对可见光图像进行了一些研究。尽管表现令人印象深刻，但所开发的技术面临的主要陷阱是对要拍摄的用户的明显限制。为了应对监控过程中暴露的个人身份问题，我们在CoCAPS项目中选择了低分辨率(64 x 64像素)的红外传感器，该传感器带有天花板位置，保证了对人的亲密性的尊重。与参与CoCAPS项目的工业合作伙伴合作定义，目标场景涉及办公室情况和7类动作，即:不行动，坐立不安，坐着，站起来，转身坐在座位上，慢走，快走。研究了从运动历史图像(MHI)中提取形状描述符(如定向相位的几何直方图和颜色直方图(CHOP))的经典方法来表示动作视频序列。在第一组经典方法中，提出了一种基于统计属性的模型的性能，该模型是由分段形式的重心跟踪构建的。然后通过考虑从深度学习文献中提取的其他模型(卷积神经网络(3D-CNN)，长短期记忆(LSTM))来完成比较研究。对比研究结果表明，所提出的模型具有很强的竞争力，并提供了有希望的结果(f-score的83%)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 20th IEEE International Conference on Mobile Data Management (MDM)

自引率

0.00%

发文量