{"title":"通过参数预选优化XGBoost性能用于鱼体重预测","authors":"Mahdi Hamzaoui, Mohamed Ould-Elhassen Aoueileyine, Lamia Romdhani, Ridha Bouallegue","doi":"10.3390/fishes8100505","DOIUrl":null,"url":null,"abstract":"Fish play a major role in the human nutritional system, and farmers need to know the accurate prediction of fish weight in order to optimize the production process and reduce costs. However, existing prediction methods are not efficient. The formulas for calculating fish weight are generally designed for a single species of fish or for species of a similar shape. In this paper, a new hybrid method called SFI-XGBoost is proposed. It combines the VIF (variance inflation factor), PCC (Pearson’s correlation coefficient), and XGBoost methods, and it covers different fish species. By applying GridSearchCV validation, normalization, augmentation, and encoding techniques, the obtained results show that SFI-XGBoost is more efficient than simple XGBoost. The model generated by our approach is more generalized, achieving accurate results with a wide variety of species. Using the r2_score evaluation metric, SFI-XGBoost achieves an accuracy rate of 99.94%.","PeriodicalId":12405,"journal":{"name":"Fishes","volume":"31 1","pages":"0"},"PeriodicalIF":2.1000,"publicationDate":"2023-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimizing XGBoost Performance for Fish Weight Prediction through Parameter Pre-Selection\",\"authors\":\"Mahdi Hamzaoui, Mohamed Ould-Elhassen Aoueileyine, Lamia Romdhani, Ridha Bouallegue\",\"doi\":\"10.3390/fishes8100505\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fish play a major role in the human nutritional system, and farmers need to know the accurate prediction of fish weight in order to optimize the production process and reduce costs. However, existing prediction methods are not efficient. The formulas for calculating fish weight are generally designed for a single species of fish or for species of a similar shape. In this paper, a new hybrid method called SFI-XGBoost is proposed. It combines the VIF (variance inflation factor), PCC (Pearson’s correlation coefficient), and XGBoost methods, and it covers different fish species. By applying GridSearchCV validation, normalization, augmentation, and encoding techniques, the obtained results show that SFI-XGBoost is more efficient than simple XGBoost. The model generated by our approach is more generalized, achieving accurate results with a wide variety of species. Using the r2_score evaluation metric, SFI-XGBoost achieves an accuracy rate of 99.94%.\",\"PeriodicalId\":12405,\"journal\":{\"name\":\"Fishes\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2023-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fishes\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3390/fishes8100505\",\"RegionNum\":3,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"FISHERIES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fishes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/fishes8100505","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"FISHERIES","Score":null,"Total":0}
Optimizing XGBoost Performance for Fish Weight Prediction through Parameter Pre-Selection
Fish play a major role in the human nutritional system, and farmers need to know the accurate prediction of fish weight in order to optimize the production process and reduce costs. However, existing prediction methods are not efficient. The formulas for calculating fish weight are generally designed for a single species of fish or for species of a similar shape. In this paper, a new hybrid method called SFI-XGBoost is proposed. It combines the VIF (variance inflation factor), PCC (Pearson’s correlation coefficient), and XGBoost methods, and it covers different fish species. By applying GridSearchCV validation, normalization, augmentation, and encoding techniques, the obtained results show that SFI-XGBoost is more efficient than simple XGBoost. The model generated by our approach is more generalized, achieving accurate results with a wide variety of species. Using the r2_score evaluation metric, SFI-XGBoost achieves an accuracy rate of 99.94%.