Jayanta Kumar Basak, Bhola Paudel, Nibas Chandra Deb, Dae Yeong Kang, Mohammed Abdus Salam, Sanjay Saha Sonet, Hyeon Tae Kim
{"title":"利用生物物理数据预测猪舍甲烷浓度的机器学习模型应用","authors":"Jayanta Kumar Basak, Bhola Paudel, Nibas Chandra Deb, Dae Yeong Kang, Mohammed Abdus Salam, Sanjay Saha Sonet, Hyeon Tae Kim","doi":"10.1007/s11270-025-07954-8","DOIUrl":null,"url":null,"abstract":"<div><p>The prediction of methane (CH<sub>4</sub>) concentration is important for pig farming due to its environmental impact on pigs and farm workers. This study examined the utilization of machine learning algorithms, specifically multiple linear regression (MLR), XGBoost regression (XGB), and random forest regression (RFR), to predict CH<sub>4</sub> concentrations in pig barns during the growing-finishing stage of pigs. The dataset included five key input biophysical variables: feed intake (FI), pig mass (MP), carbon dioxide (CO<sub>2</sub>) levels, temperature (T), and relative humidity (RH). Data was collected from three experimental pig barns during 2022 and 2023 to train and test the machine learning models. Among the three machine learning models, the RFR consistently outperformed both MLR and XGB in predicting CH<sub>4</sub> concentrations. The results demonstrated better performance by the RFR model in testing (R<sup>2</sup> > 0.81), with improvements in R<sup>2</sup> of up to 1.92% and 10.46%, as well as decreases in RMSE of up to 5.74% and 20.51%, compared to the XGB and MLR across the three input datasets. In terms of stability, MLR exhibited the maximum stability, followed by RFR and XGB. Sensitivity analysis found FI to be the most influential input variable for CH<sub>4</sub> concentration prediction, with the impact ranking being FI > MP > CO<sub>2</sub> > T > RH. This study emphasized the potential of machine learning models, particularly RFR, in predicting CH<sub>4</sub> concentrations using relevant input variables. These findings enhance understanding of CH<sub>4</sub> concentration, providing useful insights into pig production and environmental management.</p></div>","PeriodicalId":808,"journal":{"name":"Water, Air, & Soil Pollution","volume":"236 5","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Application of Machine Learning Models for Predicting Methane Concentrations in Pig Barns Using Biophysical Data\",\"authors\":\"Jayanta Kumar Basak, Bhola Paudel, Nibas Chandra Deb, Dae Yeong Kang, Mohammed Abdus Salam, Sanjay Saha Sonet, Hyeon Tae Kim\",\"doi\":\"10.1007/s11270-025-07954-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The prediction of methane (CH<sub>4</sub>) concentration is important for pig farming due to its environmental impact on pigs and farm workers. This study examined the utilization of machine learning algorithms, specifically multiple linear regression (MLR), XGBoost regression (XGB), and random forest regression (RFR), to predict CH<sub>4</sub> concentrations in pig barns during the growing-finishing stage of pigs. The dataset included five key input biophysical variables: feed intake (FI), pig mass (MP), carbon dioxide (CO<sub>2</sub>) levels, temperature (T), and relative humidity (RH). Data was collected from three experimental pig barns during 2022 and 2023 to train and test the machine learning models. Among the three machine learning models, the RFR consistently outperformed both MLR and XGB in predicting CH<sub>4</sub> concentrations. The results demonstrated better performance by the RFR model in testing (R<sup>2</sup> > 0.81), with improvements in R<sup>2</sup> of up to 1.92% and 10.46%, as well as decreases in RMSE of up to 5.74% and 20.51%, compared to the XGB and MLR across the three input datasets. In terms of stability, MLR exhibited the maximum stability, followed by RFR and XGB. Sensitivity analysis found FI to be the most influential input variable for CH<sub>4</sub> concentration prediction, with the impact ranking being FI > MP > CO<sub>2</sub> > T > RH. This study emphasized the potential of machine learning models, particularly RFR, in predicting CH<sub>4</sub> concentrations using relevant input variables. These findings enhance understanding of CH<sub>4</sub> concentration, providing useful insights into pig production and environmental management.</p></div>\",\"PeriodicalId\":808,\"journal\":{\"name\":\"Water, Air, & Soil Pollution\",\"volume\":\"236 5\",\"pages\":\"\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-04-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Water, Air, & Soil Pollution\",\"FirstCategoryId\":\"6\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s11270-025-07954-8\",\"RegionNum\":4,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water, Air, & Soil Pollution","FirstCategoryId":"6","ListUrlMain":"https://link.springer.com/article/10.1007/s11270-025-07954-8","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
摘要
由于甲烷(CH4)浓度对生猪和农场工人的环境影响,因此预测其浓度对养猪业很重要。本研究利用机器学习算法,特别是多元线性回归(MLR)、XGBoost回归(XGB)和随机森林回归(RFR)来预测猪生长-肥育阶段猪舍内CH4浓度。该数据集包括5个关键输入生物物理变量:采食量(FI)、猪体质量(MP)、二氧化碳(CO2)水平、温度(T)和相对湿度(RH)。在2022年和2023年期间,从三个实验猪舍收集数据,以训练和测试机器学习模型。在三种机器学习模型中,RFR在预测CH4浓度方面始终优于MLR和XGB。结果表明,RFR模型在测试(R2 >;0.81),与三个输入数据集的XGB和MLR相比,R2提高了1.92%和10.46%,RMSE降低了5.74%和20.51%。稳定性方面,MLR表现出最大的稳定性,其次是RFR和XGB。敏感性分析发现,FI是影响CH4浓度预测的最大输入变量,影响排序为FI >; MP > CO2 >;T比;RH。本研究强调了机器学习模型,特别是RFR,在使用相关输入变量预测CH4浓度方面的潜力。这些发现增强了对CH4浓度的理解,为养猪生产和环境管理提供了有用的见解。
Application of Machine Learning Models for Predicting Methane Concentrations in Pig Barns Using Biophysical Data
The prediction of methane (CH4) concentration is important for pig farming due to its environmental impact on pigs and farm workers. This study examined the utilization of machine learning algorithms, specifically multiple linear regression (MLR), XGBoost regression (XGB), and random forest regression (RFR), to predict CH4 concentrations in pig barns during the growing-finishing stage of pigs. The dataset included five key input biophysical variables: feed intake (FI), pig mass (MP), carbon dioxide (CO2) levels, temperature (T), and relative humidity (RH). Data was collected from three experimental pig barns during 2022 and 2023 to train and test the machine learning models. Among the three machine learning models, the RFR consistently outperformed both MLR and XGB in predicting CH4 concentrations. The results demonstrated better performance by the RFR model in testing (R2 > 0.81), with improvements in R2 of up to 1.92% and 10.46%, as well as decreases in RMSE of up to 5.74% and 20.51%, compared to the XGB and MLR across the three input datasets. In terms of stability, MLR exhibited the maximum stability, followed by RFR and XGB. Sensitivity analysis found FI to be the most influential input variable for CH4 concentration prediction, with the impact ranking being FI > MP > CO2 > T > RH. This study emphasized the potential of machine learning models, particularly RFR, in predicting CH4 concentrations using relevant input variables. These findings enhance understanding of CH4 concentration, providing useful insights into pig production and environmental management.
期刊介绍:
Water, Air, & Soil Pollution is an international, interdisciplinary journal on all aspects of pollution and solutions to pollution in the biosphere. This includes chemical, physical and biological processes affecting flora, fauna, water, air and soil in relation to environmental pollution. Because of its scope, the subject areas are diverse and include all aspects of pollution sources, transport, deposition, accumulation, acid precipitation, atmospheric pollution, metals, aquatic pollution including marine pollution and ground water, waste water, pesticides, soil pollution, sewage, sediment pollution, forestry pollution, effects of pollutants on humans, vegetation, fish, aquatic species, micro-organisms, and animals, environmental and molecular toxicology applied to pollution research, biosensors, global and climate change, ecological implications of pollution and pollution models. Water, Air, & Soil Pollution also publishes manuscripts on novel methods used in the study of environmental pollutants, environmental toxicology, environmental biology, novel environmental engineering related to pollution, biodiversity as influenced by pollution, novel environmental biotechnology as applied to pollution (e.g. bioremediation), environmental modelling and biorestoration of polluted environments.
Articles should not be submitted that are of local interest only and do not advance international knowledge in environmental pollution and solutions to pollution. Articles that simply replicate known knowledge or techniques while researching a local pollution problem will normally be rejected without review. Submitted articles must have up-to-date references, employ the correct experimental replication and statistical analysis, where needed and contain a significant contribution to new knowledge. The publishing and editorial team sincerely appreciate your cooperation.
Water, Air, & Soil Pollution publishes research papers; review articles; mini-reviews; and book reviews.