Mina Shumaly , Yunsoo Park , Saif Agha , Santosh Pandey , Juan Steibel
{"title":"基于计算机视觉的动物表型分析存在不确定的识别","authors":"Mina Shumaly , Yunsoo Park , Saif Agha , Santosh Pandey , Juan Steibel","doi":"10.1016/j.compag.2025.110560","DOIUrl":null,"url":null,"abstract":"<div><div>Animal identification (ID) is key for implementing precision livestock farming technologies. Animal ID algorithms typically generate a probability vector representing the likelihood of each potential individual. Conventionally, the individual with the highest probability is selected as the putative ID. However, this practice may reduce the precision of subsequent downstream analysis by disregarding the inherent uncertainty in the probability distribution. In this study, a mixture model is proposed to incorporate the uncertainty of the ID assignment into downstream analysis, aiming to investigate the impact of ignoring/incorporating the uncertainty of assignment in the subsequent estimations.</div><div>We applied our method on two datasets: 1) a publicly available dataset of 3226 images from 30 thoroughbred horses, classified based on body morphometrics using Linear Discriminant Analysis (LDA) with an accuracy of 88 %, where we simulated independent phenotypes with varying group effect sizes and variances, and 2) a dataset comprising 1770 images from 59 Holstein cattle, classified using Support Vector Machines (SVM) with an accuracy of 95 %, where phenotypes were extracted as measurements of body area from each image.</div><div>We analyzed the phenotypic data for both datasets to estimate group means and variance components using three approaches: 1) using the correct IDs, 2) using the top probability assignments, and 3) incorporating ID uncertainty through mixture models. Our results indicate that incorporating the mixture model improved the accuracy of variance component estimation and significantly increased the confidence of ID predictions across both datasets.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"237 ","pages":"Article 110560"},"PeriodicalIF":7.7000,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Computer vision-based animal phenotyping and analysis in presence of uncertain identification\",\"authors\":\"Mina Shumaly , Yunsoo Park , Saif Agha , Santosh Pandey , Juan Steibel\",\"doi\":\"10.1016/j.compag.2025.110560\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Animal identification (ID) is key for implementing precision livestock farming technologies. Animal ID algorithms typically generate a probability vector representing the likelihood of each potential individual. Conventionally, the individual with the highest probability is selected as the putative ID. However, this practice may reduce the precision of subsequent downstream analysis by disregarding the inherent uncertainty in the probability distribution. In this study, a mixture model is proposed to incorporate the uncertainty of the ID assignment into downstream analysis, aiming to investigate the impact of ignoring/incorporating the uncertainty of assignment in the subsequent estimations.</div><div>We applied our method on two datasets: 1) a publicly available dataset of 3226 images from 30 thoroughbred horses, classified based on body morphometrics using Linear Discriminant Analysis (LDA) with an accuracy of 88 %, where we simulated independent phenotypes with varying group effect sizes and variances, and 2) a dataset comprising 1770 images from 59 Holstein cattle, classified using Support Vector Machines (SVM) with an accuracy of 95 %, where phenotypes were extracted as measurements of body area from each image.</div><div>We analyzed the phenotypic data for both datasets to estimate group means and variance components using three approaches: 1) using the correct IDs, 2) using the top probability assignments, and 3) incorporating ID uncertainty through mixture models. Our results indicate that incorporating the mixture model improved the accuracy of variance component estimation and significantly increased the confidence of ID predictions across both datasets.</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"237 \",\"pages\":\"Article 110560\"},\"PeriodicalIF\":7.7000,\"publicationDate\":\"2025-05-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169925006660\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925006660","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
Computer vision-based animal phenotyping and analysis in presence of uncertain identification
Animal identification (ID) is key for implementing precision livestock farming technologies. Animal ID algorithms typically generate a probability vector representing the likelihood of each potential individual. Conventionally, the individual with the highest probability is selected as the putative ID. However, this practice may reduce the precision of subsequent downstream analysis by disregarding the inherent uncertainty in the probability distribution. In this study, a mixture model is proposed to incorporate the uncertainty of the ID assignment into downstream analysis, aiming to investigate the impact of ignoring/incorporating the uncertainty of assignment in the subsequent estimations.
We applied our method on two datasets: 1) a publicly available dataset of 3226 images from 30 thoroughbred horses, classified based on body morphometrics using Linear Discriminant Analysis (LDA) with an accuracy of 88 %, where we simulated independent phenotypes with varying group effect sizes and variances, and 2) a dataset comprising 1770 images from 59 Holstein cattle, classified using Support Vector Machines (SVM) with an accuracy of 95 %, where phenotypes were extracted as measurements of body area from each image.
We analyzed the phenotypic data for both datasets to estimate group means and variance components using three approaches: 1) using the correct IDs, 2) using the top probability assignments, and 3) incorporating ID uncertainty through mixture models. Our results indicate that incorporating the mixture model improved the accuracy of variance component estimation and significantly increased the confidence of ID predictions across both datasets.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.