{"title":"利用音频/视频处理技术自动识别鸟类","authors":"Nikitha Sharma, Aditi Vijayeendra, Vishnu Gopakumar, Prakhar Patni, Ashwini Bhat","doi":"10.1109/ICONAT53423.2022.9725906","DOIUrl":null,"url":null,"abstract":"There are about 10,000 to 13,000 different species of birds in the world. Identification of bird species has been a taxing ordeal for ornithologists and domain experts for decades. Hence, automation of bird species classification will greatly help in enhancing ecological surveys. This paper presents a method to automatically identify bird species from a video recording of the bird by applying image and audio processing and classification techniques. The image and audio classification models, built using pre-trained neural networks - ResNet50V2 and EfficientNetB0, are trained and tested on an image and audio dataset containing 137 bird species. The datasets were curated using multiple data sources to expand the reach of the proposed model. The test accuracy rates of the two models were 97.1% and 92.4% respectively with a final overall model accuracy of 90%.","PeriodicalId":377501,"journal":{"name":"2022 International Conference for Advancement in Technology (ICONAT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Automatic Identification of Bird Species using Audio/Video Processing\",\"authors\":\"Nikitha Sharma, Aditi Vijayeendra, Vishnu Gopakumar, Prakhar Patni, Ashwini Bhat\",\"doi\":\"10.1109/ICONAT53423.2022.9725906\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There are about 10,000 to 13,000 different species of birds in the world. Identification of bird species has been a taxing ordeal for ornithologists and domain experts for decades. Hence, automation of bird species classification will greatly help in enhancing ecological surveys. This paper presents a method to automatically identify bird species from a video recording of the bird by applying image and audio processing and classification techniques. The image and audio classification models, built using pre-trained neural networks - ResNet50V2 and EfficientNetB0, are trained and tested on an image and audio dataset containing 137 bird species. The datasets were curated using multiple data sources to expand the reach of the proposed model. The test accuracy rates of the two models were 97.1% and 92.4% respectively with a final overall model accuracy of 90%.\",\"PeriodicalId\":377501,\"journal\":{\"name\":\"2022 International Conference for Advancement in Technology (ICONAT)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference for Advancement in Technology (ICONAT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICONAT53423.2022.9725906\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference for Advancement in Technology (ICONAT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICONAT53423.2022.9725906","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automatic Identification of Bird Species using Audio/Video Processing
There are about 10,000 to 13,000 different species of birds in the world. Identification of bird species has been a taxing ordeal for ornithologists and domain experts for decades. Hence, automation of bird species classification will greatly help in enhancing ecological surveys. This paper presents a method to automatically identify bird species from a video recording of the bird by applying image and audio processing and classification techniques. The image and audio classification models, built using pre-trained neural networks - ResNet50V2 and EfficientNetB0, are trained and tested on an image and audio dataset containing 137 bird species. The datasets were curated using multiple data sources to expand the reach of the proposed model. The test accuracy rates of the two models were 97.1% and 92.4% respectively with a final overall model accuracy of 90%.