Methodological approaches and algorithms for recognizing and counting animals in aerial photographs

Q3 Mathematics

Informatsionno-Upravliaiushchie Sistemy Pub Date : 2021-10-26 DOI:10.31799/1684-8853-2021-5-20-32

V. Mikhailov, Vladislav Sobolevskii, L. Kolpaschikov, Nikolay V. Soloviev, Georgiy Yakushev

{"title":"Methodological approaches and algorithms for recognizing and counting animals in aerial photographs","authors":"V. Mikhailov, Vladislav Sobolevskii, L. Kolpaschikov, Nikolay V. Soloviev, Georgiy Yakushev","doi":"10.31799/1684-8853-2021-5-20-32","DOIUrl":null,"url":null,"abstract":"Introduction: The complexity of recognition and counting of objects in a photographic image is directly related to variability of related factors: physical difference of objects from the same class, presence of images similar to objects to be recognized, non-uniform background, change of shooting conditions and position of the objects when the photo was taken. Most challenging are the problems of identifying people in crowds, animals in natural environment, cars from surveillance cameras, objects of construction and infrastructure on aerial photo images, etc. These problems have their own specific factor space, but the methodological approaches to their solution are similar. Purpose: The development of methodologies and software implementations solving the problem of recognition and counting of objects with high variability, on the example of reindeer recognition in the natural environment. Methods: Two approaches are investigated: feature-based recognition based on binary pixel classification and reference-based recognition using convolutional neural networks. Results: Methodologies and programs have been developed for pixel-by-pixel recognition with subsequent binarization, image clustering and cluster counting and image recognition using the convolutional neural network of Mask R-CNN architecture. The network is first trained to recognize animals as a class from the array of MS COCO dataset images and then trained on the array of aerial photographs of reindeer herds. Analysis of the results shows that feature-based methods with pixel-by-pixel recognition give good results on relatively simple images (recognition error 10–15%). The presence of artifacts on the image that are close to the characteristics of the reindeer images leads to a significant increase in the error. The convolutional neural network showed higher accuracy, which on the test sample was 82%, with no false positives. Practical relevance: А software prototype has been created for the recognition system based on convolutional neural networks with a web interface, and the program itself has been put into limited operation.","PeriodicalId":36977,"journal":{"name":"Informatsionno-Upravliaiushchie Sistemy","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatsionno-Upravliaiushchie Sistemy","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31799/1684-8853-2021-5-20-32","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Mathematics","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction: The complexity of recognition and counting of objects in a photographic image is directly related to variability of related factors: physical difference of objects from the same class, presence of images similar to objects to be recognized, non-uniform background, change of shooting conditions and position of the objects when the photo was taken. Most challenging are the problems of identifying people in crowds, animals in natural environment, cars from surveillance cameras, objects of construction and infrastructure on aerial photo images, etc. These problems have their own specific factor space, but the methodological approaches to their solution are similar. Purpose: The development of methodologies and software implementations solving the problem of recognition and counting of objects with high variability, on the example of reindeer recognition in the natural environment. Methods: Two approaches are investigated: feature-based recognition based on binary pixel classification and reference-based recognition using convolutional neural networks. Results: Methodologies and programs have been developed for pixel-by-pixel recognition with subsequent binarization, image clustering and cluster counting and image recognition using the convolutional neural network of Mask R-CNN architecture. The network is first trained to recognize animals as a class from the array of MS COCO dataset images and then trained on the array of aerial photographs of reindeer herds. Analysis of the results shows that feature-based methods with pixel-by-pixel recognition give good results on relatively simple images (recognition error 10–15%). The presence of artifacts on the image that are close to the characteristics of the reindeer images leads to a significant increase in the error. The convolutional neural network showed higher accuracy, which on the test sample was 82%, with no false positives. Practical relevance: А software prototype has been created for the recognition system based on convolutional neural networks with a web interface, and the program itself has been put into limited operation.

查看原文本刊更多论文

航空照片中动物识别和计数的方法方法和算法

导论:摄影图像中物体识别和计数的复杂性与相关因素的可变性直接相关:同类物体的物理差异、存在与待识别物体相似的图像、背景不均匀、拍摄时拍摄条件和物体位置的变化。最具挑战性的问题是在人群中识别人、在自然环境中识别动物、在监控摄像头中识别汽车、在航拍图像中识别建筑和基础设施对象等。这些问题有其特定的因素空间，但解决它们的方法是相似的。目的:以自然环境中驯鹿的识别为例，研究解决高变异性物体的识别与计数问题的方法和软件实现。方法:研究了基于二值像素分类的特征识别方法和基于卷积神经网络的参考识别方法。结果:已经开发了用于逐像素识别的方法和程序，随后进行二值化，图像聚类和聚类计数以及使用Mask R-CNN架构的卷积神经网络进行图像识别。该网络首先被训练来识别MS COCO数据集图像阵列中的动物，然后在驯鹿群的航拍照片阵列上进行训练。结果分析表明，基于特征的逐像素识别方法对相对简单的图像具有较好的识别效果(识别误差为10-15%)。图像上存在与驯鹿图像特征接近的伪影，导致误差显著增加。卷积神经网络显示出更高的准确率，在测试样本上达到82%，没有假阳性。实际意义:基于卷积神经网络的识别系统已经创建了А软件原型，带有web界面，程序本身已经投入有限的运行。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Informatsionno-Upravliaiushchie Sistemy Mathematics-Control and Optimization

CiteScore

1.40

自引率

0.00%

发文量