Tianrui Ye , Jin Meng , Yitian Xiao , Yaqiu Lu , Aiwei Zheng , Bang Liang
{"title":"基于集成automl的页岩气生产优化框架——以涪陵页岩气田为例","authors":"Tianrui Ye , Jin Meng , Yitian Xiao , Yaqiu Lu , Aiwei Zheng , Bang Liang","doi":"10.1016/j.engeos.2024.100365","DOIUrl":null,"url":null,"abstract":"<div><div>This study introduces a comprehensive and automated framework that leverages data-driven methodologies to address various challenges in shale gas development and production. Specifically, it harnesses the power of Automated Machine Learning (AutoML) to construct an ensemble model to predict the estimated ultimate recovery (<em>EUR</em>) of shale gas wells. To demystify the “black-box” nature of the ensemble model, KernelSHAP, a kernel-based approach to compute Shapley values, is utilized for elucidating the influential factors that affect shale gas production at both global and local scales. Furthermore, a bi-objective optimization algorithm named NSGA-II is seamlessly incorporated to optimize hydraulic fracturing designs for production boost and cost control. This innovative framework addresses critical limitations often encountered in applying machine learning (ML) to shale gas production: the challenge of achieving sufficient model accuracy with limited samples, the multidisciplinary expertise required for developing robust ML models, and the need for interpretability in “black-box” models. Validation with field data from the Fuling shale gas field in the Sichuan Basin substantiates the framework's efficacy in enhancing the precision and applicability of data-driven techniques. The test accuracy of the ensemble ML model reached 83 % compared to a maximum of 72 % of single ML models. The contribution of each geological and engineering factor to the overall production was quantitatively evaluated. Fracturing design optimization raised <em>EUR</em> by 7 %–34 % under different production and cost tradeoff scenarios. The results empower domain experts to conduct more precise and objective data-driven analyses and optimizations for shale gas production with minimal expertise in data science.</div></div>","PeriodicalId":100469,"journal":{"name":"Energy Geoscience","volume":"6 1","pages":"Article 100365"},"PeriodicalIF":3.6000,"publicationDate":"2024-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Integrated AutoML-based framework for optimizing shale gas production: A case study of the Fuling shale gas field\",\"authors\":\"Tianrui Ye , Jin Meng , Yitian Xiao , Yaqiu Lu , Aiwei Zheng , Bang Liang\",\"doi\":\"10.1016/j.engeos.2024.100365\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This study introduces a comprehensive and automated framework that leverages data-driven methodologies to address various challenges in shale gas development and production. Specifically, it harnesses the power of Automated Machine Learning (AutoML) to construct an ensemble model to predict the estimated ultimate recovery (<em>EUR</em>) of shale gas wells. To demystify the “black-box” nature of the ensemble model, KernelSHAP, a kernel-based approach to compute Shapley values, is utilized for elucidating the influential factors that affect shale gas production at both global and local scales. Furthermore, a bi-objective optimization algorithm named NSGA-II is seamlessly incorporated to optimize hydraulic fracturing designs for production boost and cost control. This innovative framework addresses critical limitations often encountered in applying machine learning (ML) to shale gas production: the challenge of achieving sufficient model accuracy with limited samples, the multidisciplinary expertise required for developing robust ML models, and the need for interpretability in “black-box” models. Validation with field data from the Fuling shale gas field in the Sichuan Basin substantiates the framework's efficacy in enhancing the precision and applicability of data-driven techniques. The test accuracy of the ensemble ML model reached 83 % compared to a maximum of 72 % of single ML models. The contribution of each geological and engineering factor to the overall production was quantitatively evaluated. Fracturing design optimization raised <em>EUR</em> by 7 %–34 % under different production and cost tradeoff scenarios. The results empower domain experts to conduct more precise and objective data-driven analyses and optimizations for shale gas production with minimal expertise in data science.</div></div>\",\"PeriodicalId\":100469,\"journal\":{\"name\":\"Energy Geoscience\",\"volume\":\"6 1\",\"pages\":\"Article 100365\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2024-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Energy Geoscience\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666759224000805\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy Geoscience","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666759224000805","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Integrated AutoML-based framework for optimizing shale gas production: A case study of the Fuling shale gas field
This study introduces a comprehensive and automated framework that leverages data-driven methodologies to address various challenges in shale gas development and production. Specifically, it harnesses the power of Automated Machine Learning (AutoML) to construct an ensemble model to predict the estimated ultimate recovery (EUR) of shale gas wells. To demystify the “black-box” nature of the ensemble model, KernelSHAP, a kernel-based approach to compute Shapley values, is utilized for elucidating the influential factors that affect shale gas production at both global and local scales. Furthermore, a bi-objective optimization algorithm named NSGA-II is seamlessly incorporated to optimize hydraulic fracturing designs for production boost and cost control. This innovative framework addresses critical limitations often encountered in applying machine learning (ML) to shale gas production: the challenge of achieving sufficient model accuracy with limited samples, the multidisciplinary expertise required for developing robust ML models, and the need for interpretability in “black-box” models. Validation with field data from the Fuling shale gas field in the Sichuan Basin substantiates the framework's efficacy in enhancing the precision and applicability of data-driven techniques. The test accuracy of the ensemble ML model reached 83 % compared to a maximum of 72 % of single ML models. The contribution of each geological and engineering factor to the overall production was quantitatively evaluated. Fracturing design optimization raised EUR by 7 %–34 % under different production and cost tradeoff scenarios. The results empower domain experts to conduct more precise and objective data-driven analyses and optimizations for shale gas production with minimal expertise in data science.