{"title":"基于集成集成学习方法的地下天然裂缝识别","authors":"Guoqing Lu;Lianbo Zeng;Guoping Liu;Xiaoxuan Chen;Mehdi Ostadhassan;Xiaoyu Du;Yangkang Chen","doi":"10.1109/TGRS.2024.3525191","DOIUrl":null,"url":null,"abstract":"Natural fractures play a crucial role in the storage and seepage of oil shale. However, identifying fractures using conventional logging techniques presents challenges due to complex response characteristics and severe data imbalance. Here, we propose a highly accurate integrated ensemble learning method, called BSI-extreme gradient boosting (XGBoost), for identifying the natural fracture development, which combines several steps including isolation forests (iForests), synthetic minority oversampling techniques (SMOTEs), and XGBoost, and incorporates rock brittleness as a controlling factor in the model construction process. The proposed model effectively addresses several challenges encountered in fracture identification, including complex logging response characteristics, low precision and recall of fractured labels, and excessive sensitivity of ensemble learning to noise. To do so, the relationship between fracture density and brittle mineral content is analyzed through core analysis and X-ray diffraction (XRD). Then, conventional logging and rock brittleness are used as features for training the model. Herein, by screening the outliers of iForest, SMOTE oversampling, and feature selection, optimal hyperparameters of the model are obtained through the grid search method. The results demonstrated that using BSI-XGBoost, the testing set achieved an accuracy of 92.45%. Comparatively, this accuracy is 4.86% higher than the original XGBoost model and 3.73% higher than the B-XGBoost model, which incorporated brittleness curves but did not include oversampling and outlier removal. Collectively, this workflow provided an effective method for intelligent identification of fractures in oil shale with high accuracy based on easily accessible conventional logging curves.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-16"},"PeriodicalIF":8.6000,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Subsurface Natural Fracture Identification Using an Integrated Ensemble Learning Method\",\"authors\":\"Guoqing Lu;Lianbo Zeng;Guoping Liu;Xiaoxuan Chen;Mehdi Ostadhassan;Xiaoyu Du;Yangkang Chen\",\"doi\":\"10.1109/TGRS.2024.3525191\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Natural fractures play a crucial role in the storage and seepage of oil shale. However, identifying fractures using conventional logging techniques presents challenges due to complex response characteristics and severe data imbalance. Here, we propose a highly accurate integrated ensemble learning method, called BSI-extreme gradient boosting (XGBoost), for identifying the natural fracture development, which combines several steps including isolation forests (iForests), synthetic minority oversampling techniques (SMOTEs), and XGBoost, and incorporates rock brittleness as a controlling factor in the model construction process. The proposed model effectively addresses several challenges encountered in fracture identification, including complex logging response characteristics, low precision and recall of fractured labels, and excessive sensitivity of ensemble learning to noise. To do so, the relationship between fracture density and brittle mineral content is analyzed through core analysis and X-ray diffraction (XRD). Then, conventional logging and rock brittleness are used as features for training the model. Herein, by screening the outliers of iForest, SMOTE oversampling, and feature selection, optimal hyperparameters of the model are obtained through the grid search method. The results demonstrated that using BSI-XGBoost, the testing set achieved an accuracy of 92.45%. Comparatively, this accuracy is 4.86% higher than the original XGBoost model and 3.73% higher than the B-XGBoost model, which incorporated brittleness curves but did not include oversampling and outlier removal. Collectively, this workflow provided an effective method for intelligent identification of fractures in oil shale with high accuracy based on easily accessible conventional logging curves.\",\"PeriodicalId\":13213,\"journal\":{\"name\":\"IEEE Transactions on Geoscience and Remote Sensing\",\"volume\":\"63 \",\"pages\":\"1-16\"},\"PeriodicalIF\":8.6000,\"publicationDate\":\"2025-01-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Geoscience and Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10820892/\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10820892/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Subsurface Natural Fracture Identification Using an Integrated Ensemble Learning Method
Natural fractures play a crucial role in the storage and seepage of oil shale. However, identifying fractures using conventional logging techniques presents challenges due to complex response characteristics and severe data imbalance. Here, we propose a highly accurate integrated ensemble learning method, called BSI-extreme gradient boosting (XGBoost), for identifying the natural fracture development, which combines several steps including isolation forests (iForests), synthetic minority oversampling techniques (SMOTEs), and XGBoost, and incorporates rock brittleness as a controlling factor in the model construction process. The proposed model effectively addresses several challenges encountered in fracture identification, including complex logging response characteristics, low precision and recall of fractured labels, and excessive sensitivity of ensemble learning to noise. To do so, the relationship between fracture density and brittle mineral content is analyzed through core analysis and X-ray diffraction (XRD). Then, conventional logging and rock brittleness are used as features for training the model. Herein, by screening the outliers of iForest, SMOTE oversampling, and feature selection, optimal hyperparameters of the model are obtained through the grid search method. The results demonstrated that using BSI-XGBoost, the testing set achieved an accuracy of 92.45%. Comparatively, this accuracy is 4.86% higher than the original XGBoost model and 3.73% higher than the B-XGBoost model, which incorporated brittleness curves but did not include oversampling and outlier removal. Collectively, this workflow provided an effective method for intelligent identification of fractures in oil shale with high accuracy based on easily accessible conventional logging curves.
期刊介绍:
IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.