{"title":"基于边缘设备的视频重复动作计数优化","authors":"Hyunwoo Yu, Yubin Cho, Jong Pil Yun, S. Kang","doi":"10.1109/ITC-CSCC58803.2023.10212477","DOIUrl":null,"url":null,"abstract":"Repetitive actions are prevalent in both natural and man-made environments, offering valuable insights into the analysis of action units and underlying phenomena. Video repetition counting task aims to predict the count and frequency of the repetitive actions. Deep learning models have been developed for this task, enabling the recognition of repetitive motions without physical contact with the moving object. However, these models often perform unnecessary operations during inference due to inefficient data pre-processing. To address this issue, we propose an optimized data frame pre-processing method that minimizes redundant operations, ensuring fast and accurate inference. Furthermore, in order to enable video repetition counting on edge devices, we employ quantization for model compression, allowing the deployment of lightweight models suitable for various applications.","PeriodicalId":220939,"journal":{"name":"2023 International Technical Conference on Circuits/Systems, Computers, and Communications (ITC-CSCC)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimization of Video Repetitive Action Counting for Efficient Inference on Edge Devices\",\"authors\":\"Hyunwoo Yu, Yubin Cho, Jong Pil Yun, S. Kang\",\"doi\":\"10.1109/ITC-CSCC58803.2023.10212477\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Repetitive actions are prevalent in both natural and man-made environments, offering valuable insights into the analysis of action units and underlying phenomena. Video repetition counting task aims to predict the count and frequency of the repetitive actions. Deep learning models have been developed for this task, enabling the recognition of repetitive motions without physical contact with the moving object. However, these models often perform unnecessary operations during inference due to inefficient data pre-processing. To address this issue, we propose an optimized data frame pre-processing method that minimizes redundant operations, ensuring fast and accurate inference. Furthermore, in order to enable video repetition counting on edge devices, we employ quantization for model compression, allowing the deployment of lightweight models suitable for various applications.\",\"PeriodicalId\":220939,\"journal\":{\"name\":\"2023 International Technical Conference on Circuits/Systems, Computers, and Communications (ITC-CSCC)\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Technical Conference on Circuits/Systems, Computers, and Communications (ITC-CSCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITC-CSCC58803.2023.10212477\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Technical Conference on Circuits/Systems, Computers, and Communications (ITC-CSCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITC-CSCC58803.2023.10212477","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Optimization of Video Repetitive Action Counting for Efficient Inference on Edge Devices
Repetitive actions are prevalent in both natural and man-made environments, offering valuable insights into the analysis of action units and underlying phenomena. Video repetition counting task aims to predict the count and frequency of the repetitive actions. Deep learning models have been developed for this task, enabling the recognition of repetitive motions without physical contact with the moving object. However, these models often perform unnecessary operations during inference due to inefficient data pre-processing. To address this issue, we propose an optimized data frame pre-processing method that minimizes redundant operations, ensuring fast and accurate inference. Furthermore, in order to enable video repetition counting on edge devices, we employ quantization for model compression, allowing the deployment of lightweight models suitable for various applications.