Comparison of manual, machine learning, and hybrid methods for video annotation to extract parental care data

IF 1.5 3区 生物学 Q1 ORNITHOLOGY
Alex Hoi Hang Chan, Jingqi Liu, Terry Burke, William D. Pearse, Julia Schroeder
{"title":"Comparison of manual, machine learning, and hybrid methods for video annotation to extract parental care data","authors":"Alex Hoi Hang Chan,&nbsp;Jingqi Liu,&nbsp;Terry Burke,&nbsp;William D. Pearse,&nbsp;Julia Schroeder","doi":"10.1111/jav.03167","DOIUrl":null,"url":null,"abstract":"<p>Measuring parental care behaviour in the wild is central to the study of animal ecology and evolution, but it is often labour- and time-intensive. Efficient open-source tools have recently emerged that allow animal behaviour to be quantified from videos using machine learning and computer vision techniques, but there is limited appraisal of how these tools perform compared to traditional methods. To gain insight into how different methods perform in extracting data from videos taken in the field, we compared estimates of the parental provisioning rate of wild house sparrows <i>Passer domesticus</i> from video recordings. We compared four methods: manual annotation by experts, crowd-sourcing, automatic detection based on the open-source software DeepMeerkat, and a hybrid annotation method. We found that the data collected by the automatic method correlated with expert annotation (r = 0.62) and further show that these data are biologically meaningful as they predict brood survival. However, the automatic method produced largely biased estimates due to the detection of non-visitation events, while the crowd-sourcing and hybrid annotation produced estimates that are equivalent to expert annotation. The hybrid annotation method takes approximately 20% of annotation time compared to manual annotation, making it a more cost-effective way to collect data from videos. We provide a successful case study of how different approaches can be adopted and evaluated with a pre-existing dataset, to make informed decisions on the best way to process video datasets. If pre-existing frameworks produce biased estimates, we encourage researchers to adopt a hybrid approach of first using machine learning frameworks to preprocess videos, and then to do manual annotation to save annotation time. As open-source machine learning tools are becoming more accessible, we encourage biologists to make use of these tools to cut annotation time but still get equally accurate results without the need to develop novel algorithms from scratch.</p>","PeriodicalId":15278,"journal":{"name":"Journal of Avian Biology","volume":"2024 3-4","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2023-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jav.03167","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Avian Biology","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/jav.03167","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ORNITHOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Measuring parental care behaviour in the wild is central to the study of animal ecology and evolution, but it is often labour- and time-intensive. Efficient open-source tools have recently emerged that allow animal behaviour to be quantified from videos using machine learning and computer vision techniques, but there is limited appraisal of how these tools perform compared to traditional methods. To gain insight into how different methods perform in extracting data from videos taken in the field, we compared estimates of the parental provisioning rate of wild house sparrows Passer domesticus from video recordings. We compared four methods: manual annotation by experts, crowd-sourcing, automatic detection based on the open-source software DeepMeerkat, and a hybrid annotation method. We found that the data collected by the automatic method correlated with expert annotation (r = 0.62) and further show that these data are biologically meaningful as they predict brood survival. However, the automatic method produced largely biased estimates due to the detection of non-visitation events, while the crowd-sourcing and hybrid annotation produced estimates that are equivalent to expert annotation. The hybrid annotation method takes approximately 20% of annotation time compared to manual annotation, making it a more cost-effective way to collect data from videos. We provide a successful case study of how different approaches can be adopted and evaluated with a pre-existing dataset, to make informed decisions on the best way to process video datasets. If pre-existing frameworks produce biased estimates, we encourage researchers to adopt a hybrid approach of first using machine learning frameworks to preprocess videos, and then to do manual annotation to save annotation time. As open-source machine learning tools are becoming more accessible, we encourage biologists to make use of these tools to cut annotation time but still get equally accurate results without the need to develop novel algorithms from scratch.

Abstract Image

比较人工、机器学习和混合视频注释方法以提取父母照料数据
测量野生动物的亲代抚育行为是动物生态学和进化研究的核心,但这往往是一项耗时费力的工作。最近出现了高效的开源工具,可以使用机器学习和计算机视觉技术从视频中量化动物行为,但与传统方法相比,这些工具的表现评估有限。为了深入了解不同的方法如何从野外拍摄的视频中提取数据,我们比较了野生家雀Passer domesticus从视频记录中估计的亲代供给率。我们比较了四种方法:专家手动标注、众包、基于开源软件DeepMeerkat的自动检测和混合标注方法。我们发现自动方法收集的数据与专家注释相关(r = 0.62),并进一步表明这些数据在预测育雏存活率方面具有生物学意义。然而,由于检测到非访问事件,自动方法产生了很大的偏差估计,而众包和混合注释产生的估计相当于专家注释。与手动标注相比,混合标注方法的标注时间约为20%,是一种更具成本效益的视频数据采集方法。我们提供了一个成功的案例研究,说明如何采用不同的方法并使用预先存在的数据集进行评估,从而就处理视频数据集的最佳方式做出明智的决策。如果预先存在的框架产生有偏差的估计,我们鼓励研究人员采用混合方法,首先使用机器学习框架对视频进行预处理,然后进行手动注释以节省注释时间。随着开源机器学习工具变得越来越容易获得,我们鼓励生物学家利用这些工具来减少注释时间,但仍然可以获得同样准确的结果,而无需从头开始开发新的算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Avian Biology
Journal of Avian Biology 生物-鸟类学
CiteScore
3.70
自引率
0.00%
发文量
56
审稿时长
3 months
期刊介绍: Journal of Avian Biology publishes empirical and theoretical research in all areas of ornithology, with an emphasis on behavioural ecology, evolution and conservation.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信