Impact of variable selection and model complexity on the prediction of water quality parameters for Penaeus vannamei aquaculture in a short dataset context

IF 4.3 2区 农林科学 Q2 AGRICULTURAL ENGINEERING
Vinícius Fellype Cavalcanti de França, Luis Otavio Brito da Silva, Humber Agrelli de Andrade
{"title":"Impact of variable selection and model complexity on the prediction of water quality parameters for Penaeus vannamei aquaculture in a short dataset context","authors":"Vinícius Fellype Cavalcanti de França,&nbsp;Luis Otavio Brito da Silva,&nbsp;Humber Agrelli de Andrade","doi":"10.1016/j.aquaeng.2025.102640","DOIUrl":null,"url":null,"abstract":"<div><div>Aquaculture is expanding rapidly worldwide, increasing the demand for efficient water quality management in shrimp farming. In this study, we evaluated the impact of variable selection and model complexity on the prediction of the mean of water parameters using machine learning. Two variable selection approaches were applied: a Granger causality test to capture temporal predictability, and a backward procedure based on the Akaike Information Criterion to balance model fit and complexity. An experimental dataset of 106 observations of temperature, dissolved oxygen, salinity and pH was standardised and modelled using a linear regression and a random forest regressor. Model performance was assessed by cross-validation using mean squared error (MSE), mean absolute error (MAE) and mean absolute percentage error (MAPE) as metrics. Our results showed a significant superiority of linear regressor over the random forest, suggesting that simpler models may be more effective with limited datasets than more complex models.</div></div>","PeriodicalId":8120,"journal":{"name":"Aquacultural Engineering","volume":"112 ","pages":"Article 102640"},"PeriodicalIF":4.3000,"publicationDate":"2025-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aquacultural Engineering","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0144860925001293","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0

Abstract

Aquaculture is expanding rapidly worldwide, increasing the demand for efficient water quality management in shrimp farming. In this study, we evaluated the impact of variable selection and model complexity on the prediction of the mean of water parameters using machine learning. Two variable selection approaches were applied: a Granger causality test to capture temporal predictability, and a backward procedure based on the Akaike Information Criterion to balance model fit and complexity. An experimental dataset of 106 observations of temperature, dissolved oxygen, salinity and pH was standardised and modelled using a linear regression and a random forest regressor. Model performance was assessed by cross-validation using mean squared error (MSE), mean absolute error (MAE) and mean absolute percentage error (MAPE) as metrics. Our results showed a significant superiority of linear regressor over the random forest, suggesting that simpler models may be more effective with limited datasets than more complex models.
短数据环境下变量选择和模型复杂性对凡纳滨对虾养殖水质参数预测的影响
水产养殖在世界范围内迅速扩大,增加了对虾养殖对有效水质管理的需求。在这项研究中,我们评估了变量选择和模型复杂性对使用机器学习预测水参数平均值的影响。采用了两种变量选择方法:格兰杰因果检验来捕获时间可预测性,以及基于赤池信息准则的反向过程来平衡模型拟合和复杂性。对106个温度、溶解氧、盐度和pH值观测数据的实验数据集进行了标准化,并使用线性回归和随机森林回归模型进行了建模。以均方误差(MSE)、平均绝对误差(MAE)和平均绝对百分比误差(MAPE)为指标,通过交叉验证评估模型性能。我们的研究结果显示线性回归器比随机森林具有显著的优越性,这表明在有限的数据集上,简单的模型可能比复杂的模型更有效。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Aquacultural Engineering
Aquacultural Engineering 农林科学-农业工程
CiteScore
8.60
自引率
10.00%
发文量
63
审稿时长
>24 weeks
期刊介绍: Aquacultural Engineering is concerned with the design and development of effective aquacultural systems for marine and freshwater facilities. The journal aims to apply the knowledge gained from basic research which potentially can be translated into commercial operations. Problems of scale-up and application of research data involve many parameters, both physical and biological, making it difficult to anticipate the interaction between the unit processes and the cultured animals. Aquacultural Engineering aims to develop this bioengineering interface for aquaculture and welcomes contributions in the following areas: – Engineering and design of aquaculture facilities – Engineering-based research studies – Construction experience and techniques – In-service experience, commissioning, operation – Materials selection and their uses – Quantification of biological data and constraints
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信