Interpretability and identification of dimorphism in morphological indexes of Larimichthys crocea based on machine learning models

IF 2.3 2区 农林科学 Q2 FISHERIES
Liguo Ou , Linlin Lu , Weiguo Qian , Bilin Liu
{"title":"Interpretability and identification of dimorphism in morphological indexes of Larimichthys crocea based on machine learning models","authors":"Liguo Ou ,&nbsp;Linlin Lu ,&nbsp;Weiguo Qian ,&nbsp;Bilin Liu","doi":"10.1016/j.fishres.2025.107475","DOIUrl":null,"url":null,"abstract":"<div><div>The morphological indexes serve as a critical biological foundation for analyzing species dimorphism, play a pivotal role in population dynamics models and species assessments, and provide valuable, accurate, and cost-efficient biological information. Dimorphism identification holds significant importance for the conservation and sustainable development of <em>Larimichthys crocea</em> resources. Therefore, this study aims to validate the dimorphism effects of various morphological indexes using interpretable machine learning techniques and evaluate model performance and deviation in automatic identification. First, data visualization, significance analysis, correlation analysis, and principal component analysis (PCA) were applied to otolith morphology (OM) indexes and fish body morphology (FM) indexes. Then, the SHAP (SHapley Additive exPlanations) method of machine learning was used to analyze the importance of different morphological indexes and output the morphological indexes of importance. Finally, different machine learning models were used to analyze the identification performance and deviation of <em>Larimichthys crocea</em> dimorphism. The experimental results demonstrate that the SHAP method effectively prioritizes the importance of different morphological indexes, with the importance of OM indexes primarily concentrated in the sulcus. Within the machine learning models, OM indexes achieved a peak identification rate of 71 % (Random Forest), whereas FM indexes reached a maximum identification rate of 65 % (Random Forest and Support Vector Machine). The comparative analysis of the average effects of different models, including evaluation metrics and learning curves, demonstrates that OM indexes outperform FM indexes in terms of identification performance. The application of machine learning models not only enables a comprehensive analysis of the dimorphism in <em>Larimichthys crocea</em> but also offers effective strategies for the conservation of <em>Larimichthys crocea</em> resources and their associated biodiversity.</div></div>","PeriodicalId":50443,"journal":{"name":"Fisheries Research","volume":"288 ","pages":"Article 107475"},"PeriodicalIF":2.3000,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fisheries Research","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165783625002127","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"FISHERIES","Score":null,"Total":0}
引用次数: 0

Abstract

The morphological indexes serve as a critical biological foundation for analyzing species dimorphism, play a pivotal role in population dynamics models and species assessments, and provide valuable, accurate, and cost-efficient biological information. Dimorphism identification holds significant importance for the conservation and sustainable development of Larimichthys crocea resources. Therefore, this study aims to validate the dimorphism effects of various morphological indexes using interpretable machine learning techniques and evaluate model performance and deviation in automatic identification. First, data visualization, significance analysis, correlation analysis, and principal component analysis (PCA) were applied to otolith morphology (OM) indexes and fish body morphology (FM) indexes. Then, the SHAP (SHapley Additive exPlanations) method of machine learning was used to analyze the importance of different morphological indexes and output the morphological indexes of importance. Finally, different machine learning models were used to analyze the identification performance and deviation of Larimichthys crocea dimorphism. The experimental results demonstrate that the SHAP method effectively prioritizes the importance of different morphological indexes, with the importance of OM indexes primarily concentrated in the sulcus. Within the machine learning models, OM indexes achieved a peak identification rate of 71 % (Random Forest), whereas FM indexes reached a maximum identification rate of 65 % (Random Forest and Support Vector Machine). The comparative analysis of the average effects of different models, including evaluation metrics and learning curves, demonstrates that OM indexes outperform FM indexes in terms of identification performance. The application of machine learning models not only enables a comprehensive analysis of the dimorphism in Larimichthys crocea but also offers effective strategies for the conservation of Larimichthys crocea resources and their associated biodiversity.
基于机器学习模型的大菱鲆形态指标二态性的可解释性与识别
形态学指标是分析物种二态性的重要生物学基础,在种群动态模型和物种评价中发挥着关键作用,提供了有价值、准确、经济的生物学信息。二态性鉴定对鳜鱼资源的保护和可持续发展具有重要意义。因此,本研究旨在利用可解释机器学习技术验证各种形态指标的二态效应,并评估模型在自动识别中的性能和偏差。首先,对耳石形态(OM)指标和鱼体形态(FM)指标进行数据可视化、显著性分析、相关分析和主成分分析(PCA)。然后,利用机器学习的SHAP (SHapley Additive exPlanations)方法对不同形态指标的重要性进行分析,输出重要性形态指标。最后,利用不同的机器学习模型对大鲵二态性的识别性能和偏差进行了分析。实验结果表明,SHAP方法对不同形态指标的重要性进行了有效的优先排序,其中OM指标的重要性主要集中在沟区。在机器学习模型中,OM指标的峰值识别率为71 %(随机森林),而FM指标的最高识别率为65 %(随机森林和支持向量机)。通过对不同模型(包括评价指标和学习曲线)的平均效果进行对比分析,发现OM指标在识别性能上优于FM指标。机器学习模型的应用不仅可以全面分析大鲵的二态性,而且可以为大鲵资源及其相关生物多样性的保护提供有效的策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Fisheries Research
Fisheries Research 农林科学-渔业
CiteScore
4.50
自引率
16.70%
发文量
294
审稿时长
15 weeks
期刊介绍: This journal provides an international forum for the publication of papers in the areas of fisheries science, fishing technology, fisheries management and relevant socio-economics. The scope covers fisheries in salt, brackish and freshwater systems, and all aspects of associated ecology, environmental aspects of fisheries, and economics. Both theoretical and practical papers are acceptable, including laboratory and field experimental studies relevant to fisheries. Papers on the conservation of exploitable living resources are welcome. Review and Viewpoint articles are also published. As the specified areas inevitably impinge on and interrelate with each other, the approach of the journal is multidisciplinary, and authors are encouraged to emphasise the relevance of their own work to that of other disciplines. The journal is intended for fisheries scientists, biological oceanographers, gear technologists, economists, managers, administrators, policy makers and legislators.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信