{"title":"基于机器学习模型的大菱鲆形态指标二态性的可解释性与识别","authors":"Liguo Ou , Linlin Lu , Weiguo Qian , Bilin Liu","doi":"10.1016/j.fishres.2025.107475","DOIUrl":null,"url":null,"abstract":"<div><div>The morphological indexes serve as a critical biological foundation for analyzing species dimorphism, play a pivotal role in population dynamics models and species assessments, and provide valuable, accurate, and cost-efficient biological information. Dimorphism identification holds significant importance for the conservation and sustainable development of <em>Larimichthys crocea</em> resources. Therefore, this study aims to validate the dimorphism effects of various morphological indexes using interpretable machine learning techniques and evaluate model performance and deviation in automatic identification. First, data visualization, significance analysis, correlation analysis, and principal component analysis (PCA) were applied to otolith morphology (OM) indexes and fish body morphology (FM) indexes. Then, the SHAP (SHapley Additive exPlanations) method of machine learning was used to analyze the importance of different morphological indexes and output the morphological indexes of importance. Finally, different machine learning models were used to analyze the identification performance and deviation of <em>Larimichthys crocea</em> dimorphism. The experimental results demonstrate that the SHAP method effectively prioritizes the importance of different morphological indexes, with the importance of OM indexes primarily concentrated in the sulcus. Within the machine learning models, OM indexes achieved a peak identification rate of 71 % (Random Forest), whereas FM indexes reached a maximum identification rate of 65 % (Random Forest and Support Vector Machine). The comparative analysis of the average effects of different models, including evaluation metrics and learning curves, demonstrates that OM indexes outperform FM indexes in terms of identification performance. The application of machine learning models not only enables a comprehensive analysis of the dimorphism in <em>Larimichthys crocea</em> but also offers effective strategies for the conservation of <em>Larimichthys crocea</em> resources and their associated biodiversity.</div></div>","PeriodicalId":50443,"journal":{"name":"Fisheries Research","volume":"288 ","pages":"Article 107475"},"PeriodicalIF":2.3000,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Interpretability and identification of dimorphism in morphological indexes of Larimichthys crocea based on machine learning models\",\"authors\":\"Liguo Ou , Linlin Lu , Weiguo Qian , Bilin Liu\",\"doi\":\"10.1016/j.fishres.2025.107475\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The morphological indexes serve as a critical biological foundation for analyzing species dimorphism, play a pivotal role in population dynamics models and species assessments, and provide valuable, accurate, and cost-efficient biological information. Dimorphism identification holds significant importance for the conservation and sustainable development of <em>Larimichthys crocea</em> resources. Therefore, this study aims to validate the dimorphism effects of various morphological indexes using interpretable machine learning techniques and evaluate model performance and deviation in automatic identification. First, data visualization, significance analysis, correlation analysis, and principal component analysis (PCA) were applied to otolith morphology (OM) indexes and fish body morphology (FM) indexes. Then, the SHAP (SHapley Additive exPlanations) method of machine learning was used to analyze the importance of different morphological indexes and output the morphological indexes of importance. Finally, different machine learning models were used to analyze the identification performance and deviation of <em>Larimichthys crocea</em> dimorphism. The experimental results demonstrate that the SHAP method effectively prioritizes the importance of different morphological indexes, with the importance of OM indexes primarily concentrated in the sulcus. Within the machine learning models, OM indexes achieved a peak identification rate of 71 % (Random Forest), whereas FM indexes reached a maximum identification rate of 65 % (Random Forest and Support Vector Machine). The comparative analysis of the average effects of different models, including evaluation metrics and learning curves, demonstrates that OM indexes outperform FM indexes in terms of identification performance. The application of machine learning models not only enables a comprehensive analysis of the dimorphism in <em>Larimichthys crocea</em> but also offers effective strategies for the conservation of <em>Larimichthys crocea</em> resources and their associated biodiversity.</div></div>\",\"PeriodicalId\":50443,\"journal\":{\"name\":\"Fisheries Research\",\"volume\":\"288 \",\"pages\":\"Article 107475\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fisheries Research\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0165783625002127\",\"RegionNum\":2,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"FISHERIES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fisheries Research","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165783625002127","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"FISHERIES","Score":null,"Total":0}
Interpretability and identification of dimorphism in morphological indexes of Larimichthys crocea based on machine learning models
The morphological indexes serve as a critical biological foundation for analyzing species dimorphism, play a pivotal role in population dynamics models and species assessments, and provide valuable, accurate, and cost-efficient biological information. Dimorphism identification holds significant importance for the conservation and sustainable development of Larimichthys crocea resources. Therefore, this study aims to validate the dimorphism effects of various morphological indexes using interpretable machine learning techniques and evaluate model performance and deviation in automatic identification. First, data visualization, significance analysis, correlation analysis, and principal component analysis (PCA) were applied to otolith morphology (OM) indexes and fish body morphology (FM) indexes. Then, the SHAP (SHapley Additive exPlanations) method of machine learning was used to analyze the importance of different morphological indexes and output the morphological indexes of importance. Finally, different machine learning models were used to analyze the identification performance and deviation of Larimichthys crocea dimorphism. The experimental results demonstrate that the SHAP method effectively prioritizes the importance of different morphological indexes, with the importance of OM indexes primarily concentrated in the sulcus. Within the machine learning models, OM indexes achieved a peak identification rate of 71 % (Random Forest), whereas FM indexes reached a maximum identification rate of 65 % (Random Forest and Support Vector Machine). The comparative analysis of the average effects of different models, including evaluation metrics and learning curves, demonstrates that OM indexes outperform FM indexes in terms of identification performance. The application of machine learning models not only enables a comprehensive analysis of the dimorphism in Larimichthys crocea but also offers effective strategies for the conservation of Larimichthys crocea resources and their associated biodiversity.
期刊介绍:
This journal provides an international forum for the publication of papers in the areas of fisheries science, fishing technology, fisheries management and relevant socio-economics. The scope covers fisheries in salt, brackish and freshwater systems, and all aspects of associated ecology, environmental aspects of fisheries, and economics. Both theoretical and practical papers are acceptable, including laboratory and field experimental studies relevant to fisheries. Papers on the conservation of exploitable living resources are welcome. Review and Viewpoint articles are also published. As the specified areas inevitably impinge on and interrelate with each other, the approach of the journal is multidisciplinary, and authors are encouraged to emphasise the relevance of their own work to that of other disciplines. The journal is intended for fisheries scientists, biological oceanographers, gear technologists, economists, managers, administrators, policy makers and legislators.