{"title":"基于遗传规划的螃蟹体重预测模型","authors":"Tao Shi , Lingcheng Meng , Limiao Deng , Juan Li","doi":"10.1016/j.ecoinf.2025.103131","DOIUrl":null,"url":null,"abstract":"<div><div>Reliable weight prediction plays an important role in commercial transactions and population management of crabs. Existing research usually used predefined models to explain the relationship between the weight and the length of crabs. In this paper, we propose an effective regression method using genetic programming (GP) to build explainable models, which include more features to explore potential relationships between the weight and the physical features of crabs. The GP-based method has been evaluated on a publicly available dataset of crabs. The experimental results were compared with several baseline methods for predicting two kinds of crab weights. GP shows the best performance among all the baseline methods on the test set, i.e., 90.8% for predicting the weight of crabs and 81.3% for predicting the shucked weight of crabs in terms of coefficient of determination. Thanks to the explicit ability of feature selection, GP can select more important features to improve the prediction performance. More importantly, the generated models can provide potential interpretability, which is particularly valuable for domain experts in fisheries management and ecological research.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"88 ","pages":"Article 103131"},"PeriodicalIF":5.8000,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Explainable models for predicting crab weight based on genetic programming\",\"authors\":\"Tao Shi , Lingcheng Meng , Limiao Deng , Juan Li\",\"doi\":\"10.1016/j.ecoinf.2025.103131\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Reliable weight prediction plays an important role in commercial transactions and population management of crabs. Existing research usually used predefined models to explain the relationship between the weight and the length of crabs. In this paper, we propose an effective regression method using genetic programming (GP) to build explainable models, which include more features to explore potential relationships between the weight and the physical features of crabs. The GP-based method has been evaluated on a publicly available dataset of crabs. The experimental results were compared with several baseline methods for predicting two kinds of crab weights. GP shows the best performance among all the baseline methods on the test set, i.e., 90.8% for predicting the weight of crabs and 81.3% for predicting the shucked weight of crabs in terms of coefficient of determination. Thanks to the explicit ability of feature selection, GP can select more important features to improve the prediction performance. More importantly, the generated models can provide potential interpretability, which is particularly valuable for domain experts in fisheries management and ecological research.</div></div>\",\"PeriodicalId\":51024,\"journal\":{\"name\":\"Ecological Informatics\",\"volume\":\"88 \",\"pages\":\"Article 103131\"},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2025-04-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ecological Informatics\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1574954125001402\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954125001402","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
Explainable models for predicting crab weight based on genetic programming
Reliable weight prediction plays an important role in commercial transactions and population management of crabs. Existing research usually used predefined models to explain the relationship between the weight and the length of crabs. In this paper, we propose an effective regression method using genetic programming (GP) to build explainable models, which include more features to explore potential relationships between the weight and the physical features of crabs. The GP-based method has been evaluated on a publicly available dataset of crabs. The experimental results were compared with several baseline methods for predicting two kinds of crab weights. GP shows the best performance among all the baseline methods on the test set, i.e., 90.8% for predicting the weight of crabs and 81.3% for predicting the shucked weight of crabs in terms of coefficient of determination. Thanks to the explicit ability of feature selection, GP can select more important features to improve the prediction performance. More importantly, the generated models can provide potential interpretability, which is particularly valuable for domain experts in fisheries management and ecological research.
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.