{"title":"水的数字化身——化学与机器学习的结合","authors":"J. Farrell, S. Makarychev-Mikhailov","doi":"10.2118/213869-ms","DOIUrl":null,"url":null,"abstract":"\n Water affects almost every operation in the exploration and production (E&P) industry, with its properties important to flow assurance, to three-phase flow pressure/volume/temperature (PVT) modeling, and for fluid compatibility purposes across well construction, stimulation, and production operations. Until now, time-intensive laboratory tests or cumbersome third-party simulators were required to extract physicochemical properties. Here, a family of machine-learning-based reduced-order models (ROM), trained on rigorous first-principle thermodynamic simulation results, is presented.\n Approximately 90,000 representative produced-water samples were generated using the United States Geological Survey (USGS) Produced Waters Geochemical Database (Blondes et al. 2019), with systematic variation of the concentrations of 14 common ions. A training data set of 1 million rows was constructed, further varying temperatures and pressures using broad ranges (50-400°F and 14.7-20,000 psi). Thermodynamic simulations were used to generate a data set with more than 500 parameters, including speciation; physicochemical properties such as density, thermal conductivity, heat capacity, and salinity; and notably, the scaling potential for 11 common oilfield scale-forming minerals. More than 20 machine-learning algorithms were screened using cross-validation, and boosted decision trees were found to provide the best accuracy. The CatBoost algorithm (Prokhorenkova et al. 2018) was selected and further optimized. Model validation using unseen data showed relative errors of less than 1% for the majority of predicted properties, which is remarkable for such a complex multicomponent and multiphase system. Simulation details, modeling, and validation results are discussed.\n Trained and optimized ROMs can be incorporated in any workflow that depends on water property predictions. As a demonstration, a web application, Water Digital Avatar, was built from these ROMs to quickly and accurately process predictions of water properties and scaling potential on the basis of the entered water composition and desired conditions. The streamlined workflow provides users with model predictions in tabulated and graphical forms for analysis within the web application or offline by means of a downloaded spreadsheet.\n The developed ROMs that predict water properties enable automated decision making and improve water management workflows. The presented approach can be further extended to other oilfield, chemical, and chemical engineering applications.","PeriodicalId":241953,"journal":{"name":"Day 1 Wed, June 28, 2023","volume":"126 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Water Digital Avatar—Where Chemistry is Mixed with Machine Learning\",\"authors\":\"J. Farrell, S. Makarychev-Mikhailov\",\"doi\":\"10.2118/213869-ms\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Water affects almost every operation in the exploration and production (E&P) industry, with its properties important to flow assurance, to three-phase flow pressure/volume/temperature (PVT) modeling, and for fluid compatibility purposes across well construction, stimulation, and production operations. Until now, time-intensive laboratory tests or cumbersome third-party simulators were required to extract physicochemical properties. Here, a family of machine-learning-based reduced-order models (ROM), trained on rigorous first-principle thermodynamic simulation results, is presented.\\n Approximately 90,000 representative produced-water samples were generated using the United States Geological Survey (USGS) Produced Waters Geochemical Database (Blondes et al. 2019), with systematic variation of the concentrations of 14 common ions. A training data set of 1 million rows was constructed, further varying temperatures and pressures using broad ranges (50-400°F and 14.7-20,000 psi). Thermodynamic simulations were used to generate a data set with more than 500 parameters, including speciation; physicochemical properties such as density, thermal conductivity, heat capacity, and salinity; and notably, the scaling potential for 11 common oilfield scale-forming minerals. More than 20 machine-learning algorithms were screened using cross-validation, and boosted decision trees were found to provide the best accuracy. The CatBoost algorithm (Prokhorenkova et al. 2018) was selected and further optimized. Model validation using unseen data showed relative errors of less than 1% for the majority of predicted properties, which is remarkable for such a complex multicomponent and multiphase system. Simulation details, modeling, and validation results are discussed.\\n Trained and optimized ROMs can be incorporated in any workflow that depends on water property predictions. As a demonstration, a web application, Water Digital Avatar, was built from these ROMs to quickly and accurately process predictions of water properties and scaling potential on the basis of the entered water composition and desired conditions. The streamlined workflow provides users with model predictions in tabulated and graphical forms for analysis within the web application or offline by means of a downloaded spreadsheet.\\n The developed ROMs that predict water properties enable automated decision making and improve water management workflows. The presented approach can be further extended to other oilfield, chemical, and chemical engineering applications.\",\"PeriodicalId\":241953,\"journal\":{\"name\":\"Day 1 Wed, June 28, 2023\",\"volume\":\"126 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Day 1 Wed, June 28, 2023\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2118/213869-ms\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 1 Wed, June 28, 2023","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2118/213869-ms","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
水几乎影响着勘探与生产(E&P)行业的每一项作业,它的特性对流动保证、三相流动压力/体积/温度(PVT)建模以及在建井、增产和生产作业中的流体相容性都很重要。到目前为止,需要耗时的实验室测试或繁琐的第三方模拟器来提取物理化学性质。在这里,提出了一系列基于机器学习的降阶模型(ROM),这些模型是在严格的第一原理热力学模拟结果上训练出来的。使用美国地质调查局(USGS)采出水地球化学数据库(Blondes et al. 2019)生成了大约90,000个具有代表性的采出水样本,其中包含14种常见离子浓度的系统变化。建立了一个100万行的训练数据集,进一步改变了温度和压力的范围(50-400°F和14.7-20,000 psi)。热力学模拟生成了一个包含500多个参数的数据集,包括物种形成;物理化学性质,如密度、导热系数、热容和盐度;值得注意的是,11种常见的油田结垢矿物具有结垢潜力。使用交叉验证筛选了20多种机器学习算法,发现增强决策树提供了最好的准确性。选择CatBoost算法(Prokhorenkova et al. 2018)并进一步优化。使用未见数据的模型验证显示,大多数预测属性的相对误差小于1%,这对于这样一个复杂的多组分和多相系统来说是显着的。讨论了仿真细节、建模和验证结果。经过训练和优化的rom可以应用于任何基于水属性预测的工作流程中。作为演示,基于这些rom构建了一个web应用程序Water Digital Avatar,该应用程序可以根据输入的水成分和所需条件快速准确地处理水性质和结垢潜力的预测。简化的工作流程为用户提供表格和图形形式的模型预测,以便在web应用程序内或通过下载的电子表格离线进行分析。开发的预测水属性的rom实现了自动化决策,并改善了水管理工作流程。所提出的方法可以进一步推广到其他油田、化工和化工应用中。
Water Digital Avatar—Where Chemistry is Mixed with Machine Learning
Water affects almost every operation in the exploration and production (E&P) industry, with its properties important to flow assurance, to three-phase flow pressure/volume/temperature (PVT) modeling, and for fluid compatibility purposes across well construction, stimulation, and production operations. Until now, time-intensive laboratory tests or cumbersome third-party simulators were required to extract physicochemical properties. Here, a family of machine-learning-based reduced-order models (ROM), trained on rigorous first-principle thermodynamic simulation results, is presented.
Approximately 90,000 representative produced-water samples were generated using the United States Geological Survey (USGS) Produced Waters Geochemical Database (Blondes et al. 2019), with systematic variation of the concentrations of 14 common ions. A training data set of 1 million rows was constructed, further varying temperatures and pressures using broad ranges (50-400°F and 14.7-20,000 psi). Thermodynamic simulations were used to generate a data set with more than 500 parameters, including speciation; physicochemical properties such as density, thermal conductivity, heat capacity, and salinity; and notably, the scaling potential for 11 common oilfield scale-forming minerals. More than 20 machine-learning algorithms were screened using cross-validation, and boosted decision trees were found to provide the best accuracy. The CatBoost algorithm (Prokhorenkova et al. 2018) was selected and further optimized. Model validation using unseen data showed relative errors of less than 1% for the majority of predicted properties, which is remarkable for such a complex multicomponent and multiphase system. Simulation details, modeling, and validation results are discussed.
Trained and optimized ROMs can be incorporated in any workflow that depends on water property predictions. As a demonstration, a web application, Water Digital Avatar, was built from these ROMs to quickly and accurately process predictions of water properties and scaling potential on the basis of the entered water composition and desired conditions. The streamlined workflow provides users with model predictions in tabulated and graphical forms for analysis within the web application or offline by means of a downloaded spreadsheet.
The developed ROMs that predict water properties enable automated decision making and improve water management workflows. The presented approach can be further extended to other oilfield, chemical, and chemical engineering applications.