Mohammad Mustafa Sa'doun, C. Lippitt, Gernot Paulus, Karl-Heinrich Anders
{"title":"复杂环境中水禽自动检测与识别的卷积神经网络体系结构比较","authors":"Mohammad Mustafa Sa'doun, C. Lippitt, Gernot Paulus, Karl-Heinrich Anders","doi":"10.1553/giscience2021_02_s152","DOIUrl":null,"url":null,"abstract":"Waterfowl monitoring is an important task for understanding waterfowl distribution and habitats. Surveying approaches using hyper-spatial airborne imagery, collected by small unoccupied aerial systems (sUAS), hold potential to overcome the limitations of traditional methods while improving count efficiency and reliability. Difficulties obtaining waterfowl counts, particularly in complex image scenes, from the high quantity of imagery required hinders deployment of large-scale surveys. In this paper, we test Convolutional Neural Networks (CNNs) to understand their potential and how they behave across different versions of our waterfowl dataset. Three CNN architectures (YOLO, Retinanet and Faster RCNN) were trained on 3 hierarchical levels: waterfowl detection (True / False), waterfowl type (3 classes), and waterfowl species (8 classes). The architectures generally performed well, and results indicate that automated waterfowl detection in complex environments, and therefore enumeration, is feasible using current technology. Waterfowl identification in complex environments was not successful using the available training data, but we propose steps that might enhance the results.","PeriodicalId":29645,"journal":{"name":"GI_Forum","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Comparison of Convolutional Neural Network Architectures for Automated Detection and Identification of Waterfowl in Complex Environments\",\"authors\":\"Mohammad Mustafa Sa'doun, C. Lippitt, Gernot Paulus, Karl-Heinrich Anders\",\"doi\":\"10.1553/giscience2021_02_s152\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Waterfowl monitoring is an important task for understanding waterfowl distribution and habitats. Surveying approaches using hyper-spatial airborne imagery, collected by small unoccupied aerial systems (sUAS), hold potential to overcome the limitations of traditional methods while improving count efficiency and reliability. Difficulties obtaining waterfowl counts, particularly in complex image scenes, from the high quantity of imagery required hinders deployment of large-scale surveys. In this paper, we test Convolutional Neural Networks (CNNs) to understand their potential and how they behave across different versions of our waterfowl dataset. Three CNN architectures (YOLO, Retinanet and Faster RCNN) were trained on 3 hierarchical levels: waterfowl detection (True / False), waterfowl type (3 classes), and waterfowl species (8 classes). The architectures generally performed well, and results indicate that automated waterfowl detection in complex environments, and therefore enumeration, is feasible using current technology. Waterfowl identification in complex environments was not successful using the available training data, but we propose steps that might enhance the results.\",\"PeriodicalId\":29645,\"journal\":{\"name\":\"GI_Forum\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"GI_Forum\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1553/giscience2021_02_s152\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Social Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"GI_Forum","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1553/giscience2021_02_s152","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Social Sciences","Score":null,"Total":0}
A Comparison of Convolutional Neural Network Architectures for Automated Detection and Identification of Waterfowl in Complex Environments
Waterfowl monitoring is an important task for understanding waterfowl distribution and habitats. Surveying approaches using hyper-spatial airborne imagery, collected by small unoccupied aerial systems (sUAS), hold potential to overcome the limitations of traditional methods while improving count efficiency and reliability. Difficulties obtaining waterfowl counts, particularly in complex image scenes, from the high quantity of imagery required hinders deployment of large-scale surveys. In this paper, we test Convolutional Neural Networks (CNNs) to understand their potential and how they behave across different versions of our waterfowl dataset. Three CNN architectures (YOLO, Retinanet and Faster RCNN) were trained on 3 hierarchical levels: waterfowl detection (True / False), waterfowl type (3 classes), and waterfowl species (8 classes). The architectures generally performed well, and results indicate that automated waterfowl detection in complex environments, and therefore enumeration, is feasible using current technology. Waterfowl identification in complex environments was not successful using the available training data, but we propose steps that might enhance the results.