利用机器学习方法对微生物生长预测的深入研究

2区 工程技术 Q1 Earth and Planetary Sciences
Amin Bemani , Alireza Kazemi , Mohammad Ahmadi
{"title":"利用机器学习方法对微生物生长预测的深入研究","authors":"Amin Bemani ,&nbsp;Alireza Kazemi ,&nbsp;Mohammad Ahmadi","doi":"10.1016/j.petrol.2022.111162","DOIUrl":null,"url":null,"abstract":"<div><p><span><span>Microbial enhanced oil recovery (MEOR) is a well-known oil recovery method that is greatly influenced by the growth and metabolism of the </span>microorganisms<span><span><span><span>. Given the complexities and uncertainties associated with identifying the growth mechanism of microorganism, developing an approach to estimate bacterial concentration versus different factors viz. </span>Salinity<span>, temperature and time is still deemed a challenge. Hence, in this study, seven different machine learning methods namely </span></span>Artificial Neural Network, </span>Support Vector Machine, Decision Tree, K-nearest Neighbors, Ensemble Learning, Random Forest and Adaptive Boosting are utilized to predict bacterial cell concentration. A databank including 110 data points of bacterial cell concentration entailing the incubation time, salinity, temperature and yeast extract has been collected and used for preparation of these models. Graphical and statistical comparisons are used to analyze the performance and accuracy of each integrated model. The retrieved results revealed that the trained ensemble learning model is the most accurate method in estimating the bacterial growth with </span></span>correlation coefficient<span> and mean squared error of 0.9163 and 0.0542 on the tested dataset, respectively. Moreover, the KNN model with correlation coefficient and mean squared error of 0.6111 and 0.1192, respectively, is the worst model among the seven estimators. This model has great accuracy in training phase while it is not accurate in validation and testing phase. Due to this fact, it can be concluded that KNN model suffers from overfitting problem. In addition, the impacts of incubation time, yeast extract, temperature and salinity on bacterial cell concentration are also ascertained using sensitivity analysis. It is discerned that the temperature and yeast extract are the most and least effective factors on growth of microorganism, respectively.</span></p></div>","PeriodicalId":16717,"journal":{"name":"Journal of Petroleum Science and Engineering","volume":"220 ","pages":"Article 111162"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An insight into the microorganism growth prediction by means of machine learning approaches\",\"authors\":\"Amin Bemani ,&nbsp;Alireza Kazemi ,&nbsp;Mohammad Ahmadi\",\"doi\":\"10.1016/j.petrol.2022.111162\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p><span><span>Microbial enhanced oil recovery (MEOR) is a well-known oil recovery method that is greatly influenced by the growth and metabolism of the </span>microorganisms<span><span><span><span>. Given the complexities and uncertainties associated with identifying the growth mechanism of microorganism, developing an approach to estimate bacterial concentration versus different factors viz. </span>Salinity<span>, temperature and time is still deemed a challenge. Hence, in this study, seven different machine learning methods namely </span></span>Artificial Neural Network, </span>Support Vector Machine, Decision Tree, K-nearest Neighbors, Ensemble Learning, Random Forest and Adaptive Boosting are utilized to predict bacterial cell concentration. A databank including 110 data points of bacterial cell concentration entailing the incubation time, salinity, temperature and yeast extract has been collected and used for preparation of these models. Graphical and statistical comparisons are used to analyze the performance and accuracy of each integrated model. The retrieved results revealed that the trained ensemble learning model is the most accurate method in estimating the bacterial growth with </span></span>correlation coefficient<span> and mean squared error of 0.9163 and 0.0542 on the tested dataset, respectively. Moreover, the KNN model with correlation coefficient and mean squared error of 0.6111 and 0.1192, respectively, is the worst model among the seven estimators. This model has great accuracy in training phase while it is not accurate in validation and testing phase. Due to this fact, it can be concluded that KNN model suffers from overfitting problem. In addition, the impacts of incubation time, yeast extract, temperature and salinity on bacterial cell concentration are also ascertained using sensitivity analysis. It is discerned that the temperature and yeast extract are the most and least effective factors on growth of microorganism, respectively.</span></p></div>\",\"PeriodicalId\":16717,\"journal\":{\"name\":\"Journal of Petroleum Science and Engineering\",\"volume\":\"220 \",\"pages\":\"Article 111162\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Petroleum Science and Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0920410522010142\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Earth and Planetary Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Petroleum Science and Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0920410522010142","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Earth and Planetary Sciences","Score":null,"Total":0}
引用次数: 0

摘要

微生物提高采油(MEOR)是一种众所周知的采油方法,它受微生物生长和代谢的影响很大。鉴于确定微生物生长机制的复杂性和不确定性,开发一种方法来估计细菌浓度与不同因素(如盐度、温度和时间)之间的关系仍然被认为是一项挑战。因此,在本研究中,利用人工神经网络、支持向量机、决策树、k近邻、集成学习、随机森林和自适应增强等七种不同的机器学习方法来预测细菌细胞浓度。收集了110个数据点的细菌细胞浓度,包括孵育时间、盐度、温度和酵母提取物,并用于制备这些模型。采用图形和统计比较来分析每个集成模型的性能和准确性。检索结果表明,训练后的集成学习模型是估计细菌生长最准确的方法,相关系数和均方误差分别为0.9163和0.0542。KNN模型的相关系数和均方误差分别为0.6111和0.1192,是7个估计器中最差的模型。该模型在训练阶段具有很高的准确性,而在验证和测试阶段则不准确。由此可见,KNN模型存在过拟合问题。此外,还利用敏感性分析确定了培养时间、酵母浸膏、温度和盐度对细菌细胞浓度的影响。结果表明,温度对微生物生长影响最大,酵母浸膏对微生物生长影响最小。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

An insight into the microorganism growth prediction by means of machine learning approaches

An insight into the microorganism growth prediction by means of machine learning approaches

Microbial enhanced oil recovery (MEOR) is a well-known oil recovery method that is greatly influenced by the growth and metabolism of the microorganisms. Given the complexities and uncertainties associated with identifying the growth mechanism of microorganism, developing an approach to estimate bacterial concentration versus different factors viz. Salinity, temperature and time is still deemed a challenge. Hence, in this study, seven different machine learning methods namely Artificial Neural Network, Support Vector Machine, Decision Tree, K-nearest Neighbors, Ensemble Learning, Random Forest and Adaptive Boosting are utilized to predict bacterial cell concentration. A databank including 110 data points of bacterial cell concentration entailing the incubation time, salinity, temperature and yeast extract has been collected and used for preparation of these models. Graphical and statistical comparisons are used to analyze the performance and accuracy of each integrated model. The retrieved results revealed that the trained ensemble learning model is the most accurate method in estimating the bacterial growth with correlation coefficient and mean squared error of 0.9163 and 0.0542 on the tested dataset, respectively. Moreover, the KNN model with correlation coefficient and mean squared error of 0.6111 and 0.1192, respectively, is the worst model among the seven estimators. This model has great accuracy in training phase while it is not accurate in validation and testing phase. Due to this fact, it can be concluded that KNN model suffers from overfitting problem. In addition, the impacts of incubation time, yeast extract, temperature and salinity on bacterial cell concentration are also ascertained using sensitivity analysis. It is discerned that the temperature and yeast extract are the most and least effective factors on growth of microorganism, respectively.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Petroleum Science and Engineering
Journal of Petroleum Science and Engineering 工程技术-地球科学综合
CiteScore
11.30
自引率
0.00%
发文量
1511
审稿时长
13.5 months
期刊介绍: The objective of the Journal of Petroleum Science and Engineering is to bridge the gap between the engineering, the geology and the science of petroleum and natural gas by publishing explicitly written articles intelligible to scientists and engineers working in any field of petroleum engineering, natural gas engineering and petroleum (natural gas) geology. An attempt is made in all issues to balance the subject matter and to appeal to a broad readership. The Journal of Petroleum Science and Engineering covers the fields of petroleum (and natural gas) exploration, production and flow in its broadest possible sense. Topics include: origin and accumulation of petroleum and natural gas; petroleum geochemistry; reservoir engineering; reservoir simulation; rock mechanics; petrophysics; pore-level phenomena; well logging, testing and evaluation; mathematical modelling; enhanced oil and gas recovery; petroleum geology; compaction/diagenesis; petroleum economics; drilling and drilling fluids; thermodynamics and phase behavior; fluid mechanics; multi-phase flow in porous media; production engineering; formation evaluation; exploration methods; CO2 Sequestration in geological formations/sub-surface; management and development of unconventional resources such as heavy oil and bitumen, tight oil and liquid rich shales.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信