基于隐马尔可夫模型多源数据的发情奶牛多模态融合检测方法

IF 8.9 1区 农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY
Jun Wang , Yijia Zhao , Xiaoxia Li , Yu Zhou , Kaixuan Zhao , Hui Wang , Waleid Mohamed EL-Sayed Shakweer
{"title":"基于隐马尔可夫模型多源数据的发情奶牛多模态融合检测方法","authors":"Jun Wang ,&nbsp;Yijia Zhao ,&nbsp;Xiaoxia Li ,&nbsp;Yu Zhou ,&nbsp;Kaixuan Zhao ,&nbsp;Hui Wang ,&nbsp;Waleid Mohamed EL-Sayed Shakweer","doi":"10.1016/j.compag.2025.110391","DOIUrl":null,"url":null,"abstract":"<div><div>The combination of multimodal features based on vocalization and behavioral traits has been proven to enhance the robustness and accuracy of estrus detection. However, challenges still need to be addressed to improve the reliability and practicality of the identification of estrus cows, including unclear association mechanisms between multi-feature estrus traits and complex estrus states, inadequate feature selection and fusion strategies, and limitations in algorithm performance. To cope with these difficulties, the Friedman test, Mantel test, Spearman rank correlation coefficient, Kruskal-Wallis test, and Canonical Correspondence Analysis (CCA) were employed to explore the complex interactions among high-dimensional estrus data. Moreover, the model interpretability framework based on the self-attention mechanism was constructed to reveal the importance of critical features and optimize feature combinations. In addition, the multimodal fusion approach integrating standardization processing, principal component analysis, Fourier transform, statistical indicators extraction, and semi-tensor product was designed to elevate the depth of representation and the fusion effect of multi-dimensional data by comprehensively analyzing the relationships among various features. Furthermore, a neural network-optimized Hidden Markov Model (NN-HMM) for estrus detection was proposed to promote the capability of estrus detection by overcoming the imperfections of traditional HMM in capturing long distance dependence effect in state transition matrices, describing complex feature relationship in emission probability matrices, and dynamic adaptability of optimal path generation. The experimental results demonstrated that the selection of optimal feature combination integrating acoustic and behavioral features (number of bellowings, number of lowings, number of consecutive bellowings, vocalization frequency, standing duration, lying duration, walking duration, feeding duration, activity index, number of standing mounts, number of social behaviors, and ruminating variation index) improved estrus detection accuracy by over 18 % compared to using suboptimal feature combinations. Meanwhile, compared with Support Vector Machine (SVM), Convolutional Neural Network (CNN), and Long Short-Term Memory (LSTM) models, the accuracy of the developed multimodal fusion-based detection method increased by 8.8 %, 7.4 %, and 6.1 %, while the precision enhanced by 7.4 %, 6.0 %, and 4.3 %, respectively. Blind testing conducted on multisource estrus data from 24 multiparous and 16 primiparous cows found that the proposed prediction method advanced the prediction of estrus onset by up to 85 and 40 mins for primiparous cows and up to 75 and 60 mins for multiparous cows, respectively, significantly surpassing conventional activity index and acoustic-based methods. Therefore, the method proposed in this study undoubtedly provides a reliable solution for timely and efficient detection of estrus in dairy farms.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"235 ","pages":"Article 110391"},"PeriodicalIF":8.9000,"publicationDate":"2025-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multimodal fusion-based detection method of estrus cows using multisource data inspired by hidden Markov model algorithms\",\"authors\":\"Jun Wang ,&nbsp;Yijia Zhao ,&nbsp;Xiaoxia Li ,&nbsp;Yu Zhou ,&nbsp;Kaixuan Zhao ,&nbsp;Hui Wang ,&nbsp;Waleid Mohamed EL-Sayed Shakweer\",\"doi\":\"10.1016/j.compag.2025.110391\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The combination of multimodal features based on vocalization and behavioral traits has been proven to enhance the robustness and accuracy of estrus detection. However, challenges still need to be addressed to improve the reliability and practicality of the identification of estrus cows, including unclear association mechanisms between multi-feature estrus traits and complex estrus states, inadequate feature selection and fusion strategies, and limitations in algorithm performance. To cope with these difficulties, the Friedman test, Mantel test, Spearman rank correlation coefficient, Kruskal-Wallis test, and Canonical Correspondence Analysis (CCA) were employed to explore the complex interactions among high-dimensional estrus data. Moreover, the model interpretability framework based on the self-attention mechanism was constructed to reveal the importance of critical features and optimize feature combinations. In addition, the multimodal fusion approach integrating standardization processing, principal component analysis, Fourier transform, statistical indicators extraction, and semi-tensor product was designed to elevate the depth of representation and the fusion effect of multi-dimensional data by comprehensively analyzing the relationships among various features. Furthermore, a neural network-optimized Hidden Markov Model (NN-HMM) for estrus detection was proposed to promote the capability of estrus detection by overcoming the imperfections of traditional HMM in capturing long distance dependence effect in state transition matrices, describing complex feature relationship in emission probability matrices, and dynamic adaptability of optimal path generation. The experimental results demonstrated that the selection of optimal feature combination integrating acoustic and behavioral features (number of bellowings, number of lowings, number of consecutive bellowings, vocalization frequency, standing duration, lying duration, walking duration, feeding duration, activity index, number of standing mounts, number of social behaviors, and ruminating variation index) improved estrus detection accuracy by over 18 % compared to using suboptimal feature combinations. Meanwhile, compared with Support Vector Machine (SVM), Convolutional Neural Network (CNN), and Long Short-Term Memory (LSTM) models, the accuracy of the developed multimodal fusion-based detection method increased by 8.8 %, 7.4 %, and 6.1 %, while the precision enhanced by 7.4 %, 6.0 %, and 4.3 %, respectively. Blind testing conducted on multisource estrus data from 24 multiparous and 16 primiparous cows found that the proposed prediction method advanced the prediction of estrus onset by up to 85 and 40 mins for primiparous cows and up to 75 and 60 mins for multiparous cows, respectively, significantly surpassing conventional activity index and acoustic-based methods. Therefore, the method proposed in this study undoubtedly provides a reliable solution for timely and efficient detection of estrus in dairy farms.</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"235 \",\"pages\":\"Article 110391\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-04-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169925004971\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925004971","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

基于发声特征和行为特征的多模态特征相结合已被证明可以提高发情检测的鲁棒性和准确性。然而,多特征发情特征与复杂发情状态之间的关联机制不明确、特征选择和融合策略不完善、算法性能存在局限性等问题仍需解决,以提高发情奶牛识别的可靠性和实用性。针对这些困难,采用Friedman检验、Mantel检验、Spearman秩相关系数检验、Kruskal-Wallis检验和标准对应分析(Canonical Correspondence Analysis, CCA)来探讨高维发情数据之间复杂的相互作用。构建了基于自注意机制的模型可解释性框架,揭示关键特征的重要性,优化特征组合。此外,设计了集标准化处理、主成分分析、傅立叶变换、统计指标提取、半张量积为一体的多模态融合方法,通过综合分析各种特征之间的关系,提高多维数据的表示深度和融合效果。在此基础上,提出了一种神经网络优化的隐马尔可夫模型(NN-HMM),克服了传统隐马尔可夫模型在捕捉状态转移矩阵中的长距离依赖效应、描述发射概率矩阵中的复杂特征关系以及最优路径生成的动态适应性等方面的缺陷,提高了发情检测的能力。实验结果表明,结合声学和行为特征(鸣叫次数、鸣叫次数、连续鸣叫次数、发声频率、站立时长、躺卧时长、行走时长、进食时长、活动指数、站立坐垫次数、社会行为次数、与使用次优特征组合相比,反刍变异指数(反刍变异指数)将发情检测准确率提高了18%以上。与支持向量机(SVM)、卷积神经网络(CNN)和长短期记忆(LSTM)模型相比,所开发的多模态融合检测方法的准确率分别提高了8.8%、7.4%和6.1%,精度分别提高了7.4%、6.0%和4.3%。通过对24头多产奶牛和16头初产奶牛的多源发情数据进行盲测,发现该预测方法对初产奶牛的发情时间预测分别提前85分钟和40分钟,对多产奶牛的发情时间预测分别提前75分钟和60分钟,明显优于传统的活动指数和基于声学的方法。因此,本研究提出的方法无疑为奶牛场及时、高效地检测发情提供了可靠的解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multimodal fusion-based detection method of estrus cows using multisource data inspired by hidden Markov model algorithms
The combination of multimodal features based on vocalization and behavioral traits has been proven to enhance the robustness and accuracy of estrus detection. However, challenges still need to be addressed to improve the reliability and practicality of the identification of estrus cows, including unclear association mechanisms between multi-feature estrus traits and complex estrus states, inadequate feature selection and fusion strategies, and limitations in algorithm performance. To cope with these difficulties, the Friedman test, Mantel test, Spearman rank correlation coefficient, Kruskal-Wallis test, and Canonical Correspondence Analysis (CCA) were employed to explore the complex interactions among high-dimensional estrus data. Moreover, the model interpretability framework based on the self-attention mechanism was constructed to reveal the importance of critical features and optimize feature combinations. In addition, the multimodal fusion approach integrating standardization processing, principal component analysis, Fourier transform, statistical indicators extraction, and semi-tensor product was designed to elevate the depth of representation and the fusion effect of multi-dimensional data by comprehensively analyzing the relationships among various features. Furthermore, a neural network-optimized Hidden Markov Model (NN-HMM) for estrus detection was proposed to promote the capability of estrus detection by overcoming the imperfections of traditional HMM in capturing long distance dependence effect in state transition matrices, describing complex feature relationship in emission probability matrices, and dynamic adaptability of optimal path generation. The experimental results demonstrated that the selection of optimal feature combination integrating acoustic and behavioral features (number of bellowings, number of lowings, number of consecutive bellowings, vocalization frequency, standing duration, lying duration, walking duration, feeding duration, activity index, number of standing mounts, number of social behaviors, and ruminating variation index) improved estrus detection accuracy by over 18 % compared to using suboptimal feature combinations. Meanwhile, compared with Support Vector Machine (SVM), Convolutional Neural Network (CNN), and Long Short-Term Memory (LSTM) models, the accuracy of the developed multimodal fusion-based detection method increased by 8.8 %, 7.4 %, and 6.1 %, while the precision enhanced by 7.4 %, 6.0 %, and 4.3 %, respectively. Blind testing conducted on multisource estrus data from 24 multiparous and 16 primiparous cows found that the proposed prediction method advanced the prediction of estrus onset by up to 85 and 40 mins for primiparous cows and up to 75 and 60 mins for multiparous cows, respectively, significantly surpassing conventional activity index and acoustic-based methods. Therefore, the method proposed in this study undoubtedly provides a reliable solution for timely and efficient detection of estrus in dairy farms.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computers and Electronics in Agriculture
Computers and Electronics in Agriculture 工程技术-计算机:跨学科应用
CiteScore
15.30
自引率
14.50%
发文量
800
审稿时长
62 days
期刊介绍: Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信