Wazha Mmereki, R. Jamisola, Dimane Mpoeleng, Tinao Petso
{"title":"基于yolov3的移动高空航拍人眼活动识别","authors":"Wazha Mmereki, R. Jamisola, Dimane Mpoeleng, Tinao Petso","doi":"10.1109/ICARA51699.2021.9376435","DOIUrl":null,"url":null,"abstract":"This paper presents a method to classify human activities as normal or suspicious using YOLOv3 to automatically process video footages taken from a high altitude moving aerial camera, such as the one attached to a drone. We consider four human activities namely, jogging, walking, fighting, and chasing. Objects generally appear much smaller, with less visible features, when viewed from high altitudes. The reduced visible features make automatic human activity detection from ground surveillance cameras not applicable to the high altitude case. Through transfer learning, we modified existing pre-trained YOLOv3 convolutional neural networks (CNN‘s) and retrained with our own high aerial human action dataset. By so doing, we were able to customize YOLOv3 to detect, localize, and recognize aerial human activities in real-time as normal or suspicious. The proposed approach achieves a promising average precision accuracy of 82.30%, and average F1 score of 88.10% on classifying high aerial human activities. We demonstrated that YOLOv3 is a powerful approach and relatively fast for the recognition and localization of human subjects as seen from above.","PeriodicalId":183788,"journal":{"name":"2021 7th International Conference on Automation, Robotics and Applications (ICARA)","volume":"191 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"YOLOv3-Based Human Activity Recognition as Viewed from a Moving High-Altitude Aerial Camera\",\"authors\":\"Wazha Mmereki, R. Jamisola, Dimane Mpoeleng, Tinao Petso\",\"doi\":\"10.1109/ICARA51699.2021.9376435\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a method to classify human activities as normal or suspicious using YOLOv3 to automatically process video footages taken from a high altitude moving aerial camera, such as the one attached to a drone. We consider four human activities namely, jogging, walking, fighting, and chasing. Objects generally appear much smaller, with less visible features, when viewed from high altitudes. The reduced visible features make automatic human activity detection from ground surveillance cameras not applicable to the high altitude case. Through transfer learning, we modified existing pre-trained YOLOv3 convolutional neural networks (CNN‘s) and retrained with our own high aerial human action dataset. By so doing, we were able to customize YOLOv3 to detect, localize, and recognize aerial human activities in real-time as normal or suspicious. The proposed approach achieves a promising average precision accuracy of 82.30%, and average F1 score of 88.10% on classifying high aerial human activities. We demonstrated that YOLOv3 is a powerful approach and relatively fast for the recognition and localization of human subjects as seen from above.\",\"PeriodicalId\":183788,\"journal\":{\"name\":\"2021 7th International Conference on Automation, Robotics and Applications (ICARA)\",\"volume\":\"191 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-02-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 7th International Conference on Automation, Robotics and Applications (ICARA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICARA51699.2021.9376435\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th International Conference on Automation, Robotics and Applications (ICARA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICARA51699.2021.9376435","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
YOLOv3-Based Human Activity Recognition as Viewed from a Moving High-Altitude Aerial Camera
This paper presents a method to classify human activities as normal or suspicious using YOLOv3 to automatically process video footages taken from a high altitude moving aerial camera, such as the one attached to a drone. We consider four human activities namely, jogging, walking, fighting, and chasing. Objects generally appear much smaller, with less visible features, when viewed from high altitudes. The reduced visible features make automatic human activity detection from ground surveillance cameras not applicable to the high altitude case. Through transfer learning, we modified existing pre-trained YOLOv3 convolutional neural networks (CNN‘s) and retrained with our own high aerial human action dataset. By so doing, we were able to customize YOLOv3 to detect, localize, and recognize aerial human activities in real-time as normal or suspicious. The proposed approach achieves a promising average precision accuracy of 82.30%, and average F1 score of 88.10% on classifying high aerial human activities. We demonstrated that YOLOv3 is a powerful approach and relatively fast for the recognition and localization of human subjects as seen from above.