基于机器学习的碳酸盐岩储层胶结系数预测特征选择方法对比分析

Day 2 Wed, March 27, 2019 Pub Date : 2019-03-22 DOI:10.2523/IPTC-19330-MS

F. Anifowose, C. Ayadiuno, Faisal Rashedan

{"title":"基于机器学习的碳酸盐岩储层胶结系数预测特征选择方法对比分析","authors":"F. Anifowose, C. Ayadiuno, Faisal Rashedan","doi":"10.2523/IPTC-19330-MS","DOIUrl":null,"url":null,"abstract":"\n A key component of the fourth industrial revolution is data integration. However, this comes with a major challenge: handling increased input feature dimensionality. Multivariate feature space increases model complexity, memory utilization, and computational intensity, thereby reducing model performance. A pragmatic approach to input feature space reduction is therefore required. This paper presents a comparative study of the performance of a nonlinear feature selection methodology based on fuzzy ranking (FR). The FR algorithm is extracted from a segment of Fuzzy Logic, an existing machine learning technique. The performance of this feature selection algorithm is tested and validated with respect to the prediction of cementation factor as a log from wireline measurements using machine learning techniques. Cementation factor is denoted by the exponent m in Archie's equation. A subset of the log data selected by the FR algorithm is automatically fed into artificial neural network (ANN) and support vector machine (SVM) models to build FR-ANN and FR-SVM hybrid learning models. A multivariate linear regression (MLR) model is also implemented. The performance of the hybrid models is compared to those of MLR, ANN and SVM without the feature selection procedure. We further compare the outcome with ANN and SVM fed with linearly correlated input features. The hybrid learning methodology is driven by patterns discovered in the data and eliminates subjective human bias in the choice of the input features. It also takes into consideration the possible nonlinear relationship between the wireline logs and m. The FR-ANN model shows improved performance over the other models with the highest correlation coefficient and lowest root mean squared error. The performance of the FR-ANN hybrid model demonstrates the efficiency of the proposed nonlinear feature selection hybrid methodology. A future work will apply this approach to high dimensional, integrated data types from many wells. We expect that the outcome will significantly improve the prediction accuracy and further impact reservoir models using the predicted properties.","PeriodicalId":105730,"journal":{"name":"Day 2 Wed, March 27, 2019","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Comparative Analysis of Machine Learning Based Feature Selection Approach for Carbonate Reservoir Cementation Factor Prediction\",\"authors\":\"F. Anifowose, C. Ayadiuno, Faisal Rashedan\",\"doi\":\"10.2523/IPTC-19330-MS\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n A key component of the fourth industrial revolution is data integration. However, this comes with a major challenge: handling increased input feature dimensionality. Multivariate feature space increases model complexity, memory utilization, and computational intensity, thereby reducing model performance. A pragmatic approach to input feature space reduction is therefore required. This paper presents a comparative study of the performance of a nonlinear feature selection methodology based on fuzzy ranking (FR). The FR algorithm is extracted from a segment of Fuzzy Logic, an existing machine learning technique. The performance of this feature selection algorithm is tested and validated with respect to the prediction of cementation factor as a log from wireline measurements using machine learning techniques. Cementation factor is denoted by the exponent m in Archie's equation. A subset of the log data selected by the FR algorithm is automatically fed into artificial neural network (ANN) and support vector machine (SVM) models to build FR-ANN and FR-SVM hybrid learning models. A multivariate linear regression (MLR) model is also implemented. The performance of the hybrid models is compared to those of MLR, ANN and SVM without the feature selection procedure. We further compare the outcome with ANN and SVM fed with linearly correlated input features. The hybrid learning methodology is driven by patterns discovered in the data and eliminates subjective human bias in the choice of the input features. It also takes into consideration the possible nonlinear relationship between the wireline logs and m. The FR-ANN model shows improved performance over the other models with the highest correlation coefficient and lowest root mean squared error. The performance of the FR-ANN hybrid model demonstrates the efficiency of the proposed nonlinear feature selection hybrid methodology. A future work will apply this approach to high dimensional, integrated data types from many wells. We expect that the outcome will significantly improve the prediction accuracy and further impact reservoir models using the predicted properties.\",\"PeriodicalId\":105730,\"journal\":{\"name\":\"Day 2 Wed, March 27, 2019\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-03-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Day 2 Wed, March 27, 2019\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2523/IPTC-19330-MS\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 2 Wed, March 27, 2019","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2523/IPTC-19330-MS","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

第四次工业革命的一个关键组成部分是数据集成。然而，这带来了一个主要的挑战:处理增加的输入特征维度。多变量特征空间增加了模型的复杂性、内存利用率和计算强度，从而降低了模型的性能。因此，需要一种实用的方法来减少输入特征空间。本文对基于模糊排序的非线性特征选择方法的性能进行了比较研究。FR算法是从模糊逻辑的一个片段中提取的，模糊逻辑是一种现有的机器学习技术。该特征选择算法的性能经过了测试和验证，并使用机器学习技术从电缆测量数据中预测胶结系数。胶结系数用阿尔奇方程中的指数m表示。将FR算法选择的测井数据子集自动输入到人工神经网络(ANN)和支持向量机(SVM)模型中，构建FR-ANN和FR-SVM混合学习模型。多元线性回归(MLR)模型也被实现。将混合模型的性能与未加特征选择的MLR、ANN和SVM进行了比较。我们进一步将结果与具有线性相关输入特征的人工神经网络和支持向量机进行比较。混合学习方法由数据中发现的模式驱动，消除了人类在选择输入特征时的主观偏见。它还考虑了电缆测井曲线与m之间可能存在的非线性关系。与其他相关系数最高、均方根误差最低的模型相比，FR-ANN模型表现出更好的性能。该混合模型的性能证明了所提出的非线性特征选择混合方法的有效性。未来的工作将把这种方法应用于来自许多井的高维、集成数据类型。我们期望这一结果将显著提高预测精度，并利用预测的性质进一步影响储层模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Comparative Analysis of Machine Learning Based Feature Selection Approach for Carbonate Reservoir Cementation Factor Prediction

A key component of the fourth industrial revolution is data integration. However, this comes with a major challenge: handling increased input feature dimensionality. Multivariate feature space increases model complexity, memory utilization, and computational intensity, thereby reducing model performance. A pragmatic approach to input feature space reduction is therefore required. This paper presents a comparative study of the performance of a nonlinear feature selection methodology based on fuzzy ranking (FR). The FR algorithm is extracted from a segment of Fuzzy Logic, an existing machine learning technique. The performance of this feature selection algorithm is tested and validated with respect to the prediction of cementation factor as a log from wireline measurements using machine learning techniques. Cementation factor is denoted by the exponent m in Archie's equation. A subset of the log data selected by the FR algorithm is automatically fed into artificial neural network (ANN) and support vector machine (SVM) models to build FR-ANN and FR-SVM hybrid learning models. A multivariate linear regression (MLR) model is also implemented. The performance of the hybrid models is compared to those of MLR, ANN and SVM without the feature selection procedure. We further compare the outcome with ANN and SVM fed with linearly correlated input features. The hybrid learning methodology is driven by patterns discovered in the data and eliminates subjective human bias in the choice of the input features. It also takes into consideration the possible nonlinear relationship between the wireline logs and m. The FR-ANN model shows improved performance over the other models with the highest correlation coefficient and lowest root mean squared error. The performance of the FR-ANN hybrid model demonstrates the efficiency of the proposed nonlinear feature selection hybrid methodology. A future work will apply this approach to high dimensional, integrated data types from many wells. We expect that the outcome will significantly improve the prediction accuracy and further impact reservoir models using the predicted properties.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Day 2 Wed, March 27, 2019

自引率

0.00%

发文量