Lei Lin , Zhi Zhong , Chenglong Li , Andrew Gorman , Hao Wei , Yanbin Kuang , Shiqi Wen , Zhongxian Cai , Fang Hao
{"title":"Machine learning for subsurface geological feature identification from seismic data: Methods, datasets, challenges, and opportunities","authors":"Lei Lin , Zhi Zhong , Chenglong Li , Andrew Gorman , Hao Wei , Yanbin Kuang , Shiqi Wen , Zhongxian Cai , Fang Hao","doi":"10.1016/j.earscirev.2024.104887","DOIUrl":null,"url":null,"abstract":"<div><p>Identification of geological features from seismic data such as faults, salt bodies, and channels, is essential for studies of the shallow Earth, natural disaster forecasting and evaluation, carbon capture and storage, hydrogen storage, geothermal energy development, and traditional resource exploration. However, manual seismic interpretation is distinctly subjective and labor-intensive. With the advent and rise of 3D surveys, the size of seismic data has increased dramatically, making purely manual interpretation impractical. Since 1989, a large number of machine learning-based methods for identifying geological features have been proposed to address these challenges. To date, these methods have not been reasonably synthesized. Motivated by a progressive increase in applications, this review presents an overview of advances in the utilization of machine learning to identify geological features from seismic data. First, we classify these methods from five different perspectives. Second, we provide a comprehensive overview of 241 publications related to seismic geological feature identification and offer a detailed analysis of the development of these methods categorized by geological feature type. Third, 20 field and 12 synthetic seismic datasets, which are publicly available and relevant to the identification of faults, salt bodies, channels, caves, and horizons, are cataloged. Fourth, we discuss the issue of false positive identification caused by the limited geological features in the training dataset. To address the problems of false positives and insufficient labeled training datasets, we propose a simulation framework for generating 3D synthetic seismic data and corresponding geological labels that include a rich variety of geological features. To the best of our knowledge, this is the synthetic seismic dataset that contains the richest geological features. Finally, we discuss in depth the current challenges and future opportunities to inspire further relevant research.</p></div>","PeriodicalId":11483,"journal":{"name":"Earth-Science Reviews","volume":"257 ","pages":"Article 104887"},"PeriodicalIF":10.8000,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth-Science Reviews","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0012825224002149","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Identification of geological features from seismic data such as faults, salt bodies, and channels, is essential for studies of the shallow Earth, natural disaster forecasting and evaluation, carbon capture and storage, hydrogen storage, geothermal energy development, and traditional resource exploration. However, manual seismic interpretation is distinctly subjective and labor-intensive. With the advent and rise of 3D surveys, the size of seismic data has increased dramatically, making purely manual interpretation impractical. Since 1989, a large number of machine learning-based methods for identifying geological features have been proposed to address these challenges. To date, these methods have not been reasonably synthesized. Motivated by a progressive increase in applications, this review presents an overview of advances in the utilization of machine learning to identify geological features from seismic data. First, we classify these methods from five different perspectives. Second, we provide a comprehensive overview of 241 publications related to seismic geological feature identification and offer a detailed analysis of the development of these methods categorized by geological feature type. Third, 20 field and 12 synthetic seismic datasets, which are publicly available and relevant to the identification of faults, salt bodies, channels, caves, and horizons, are cataloged. Fourth, we discuss the issue of false positive identification caused by the limited geological features in the training dataset. To address the problems of false positives and insufficient labeled training datasets, we propose a simulation framework for generating 3D synthetic seismic data and corresponding geological labels that include a rich variety of geological features. To the best of our knowledge, this is the synthetic seismic dataset that contains the richest geological features. Finally, we discuss in depth the current challenges and future opportunities to inspire further relevant research.
期刊介绍:
Covering a much wider field than the usual specialist journals, Earth Science Reviews publishes review articles dealing with all aspects of Earth Sciences, and is an important vehicle for allowing readers to see their particular interest related to the Earth Sciences as a whole.