Research on Intelligent and High-Precision Structure-Recognition Methods for Field Geological Outcrop Images

IF 2.2 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IET Image Processing Pub Date : 2025-04-22 DOI:10.1049/ipr2.70087

Mingguang Diao, Kaixuan Liu, Shupeng Wang, Chuyan Zhang

{"title":"Research on Intelligent and High-Precision Structure-Recognition Methods for Field Geological Outcrop Images","authors":"Mingguang Diao, Kaixuan Liu, Shupeng Wang, Chuyan Zhang","doi":"10.1049/ipr2.70087","DOIUrl":null,"url":null,"abstract":"<p>The accurate recognition of geological structures in field outcrop images is critical for applications such as geological hazard analysis, seismic risk assessment, and urban geological planning. However, traditional manual interpretation of geological images is time-consuming, labor-intensive, and subjective, limiting its scalability and precision. To address this gap, this study proposes an intelligent, automated recognition method for field geological outcrop images based on deep learning techniques. The methodology integrates Fourier transform, Canny edge detection, and Mask R-CNN instance segmentation, enhanced with image normalization and data augmentation strategies such as grayscale conversion, Gaussian filtering, and rotation. A custom dataset comprising 4260 images was constructed and annotated using a hybrid approach involving edge detection and expert labeling. The proposed model, improved with PrRoI Pooling, outperforms conventional models such as YOLOv3, Faster R-CNN, and standard Mask R-CNN, achieving a mean average precision (mAP) of 90.77% in detecting fault, fold, and sausage-like geological structures. The results demonstrate the model's robustness, accuracy, and suitability for complex geological environments. This study not only advances the state-of-the-art in geological image recognition but also lays a foundation for future research into broader structural classification, multi-modal geological data integration, and real-time field deployment.</p>","PeriodicalId":56303,"journal":{"name":"IET Image Processing","volume":"19 1","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/ipr2.70087","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Image Processing","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/ipr2.70087","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The accurate recognition of geological structures in field outcrop images is critical for applications such as geological hazard analysis, seismic risk assessment, and urban geological planning. However, traditional manual interpretation of geological images is time-consuming, labor-intensive, and subjective, limiting its scalability and precision. To address this gap, this study proposes an intelligent, automated recognition method for field geological outcrop images based on deep learning techniques. The methodology integrates Fourier transform, Canny edge detection, and Mask R-CNN instance segmentation, enhanced with image normalization and data augmentation strategies such as grayscale conversion, Gaussian filtering, and rotation. A custom dataset comprising 4260 images was constructed and annotated using a hybrid approach involving edge detection and expert labeling. The proposed model, improved with PrRoI Pooling, outperforms conventional models such as YOLOv3, Faster R-CNN, and standard Mask R-CNN, achieving a mean average precision (mAP) of 90.77% in detecting fault, fold, and sausage-like geological structures. The results demonstrate the model's robustness, accuracy, and suitability for complex geological environments. This study not only advances the state-of-the-art in geological image recognition but also lays a foundation for future research into broader structural classification, multi-modal geological data integration, and real-time field deployment.

Abstract Image

查看原文本刊更多论文

野外地质外业图像的智能高精度结构识别方法研究

准确识别野外露头图像中的地质结构对于地质灾害分析、地震风险评估和城市地质规划等应用至关重要。然而，传统的地质图像人工判读耗时、耗力、主观性强，限制了其可扩展性和精确性。针对这一不足，本研究提出了一种基于深度学习技术的野外地质露头图像智能自动识别方法。该方法集成了傅立叶变换、Canny 边缘检测和掩码 R-CNN 实例分割，并通过灰度转换、高斯滤波和旋转等图像归一化和数据增强策略加以强化。利用边缘检测和专家标注的混合方法，构建了一个包含 4260 幅图像的自定义数据集，并对其进行了标注。在检测断层、褶皱和类似香肠的地质结构方面，利用 PrRoI Pooling 改进后的模型优于 YOLOv3、Faster R-CNN 和标准掩膜 R-CNN 等传统模型，平均精确度 (mAP) 达到 90.77%。这些结果证明了该模型的鲁棒性、准确性和对复杂地质环境的适用性。这项研究不仅推动了地质图像识别技术的发展，还为今后研究更广泛的结构分类、多模式地质数据集成和实时现场部署奠定了基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IET Image Processing 工程技术-工程：电子与电气

CiteScore

5.40

自引率

8.70%

发文量

282

审稿时长

6 months

期刊介绍： The IET Image Processing journal encompasses research areas related to the generation, processing and communication of visual information. The focus of the journal is the coverage of the latest research results in image and video processing, including image generation and display, enhancement and restoration, segmentation, colour and texture analysis, coding and communication, implementations and architectures as well as innovative applications. Principal topics include: Generation and Display - Imaging sensors and acquisition systems, illumination, sampling and scanning, quantization, colour reproduction, image rendering, display and printing systems, evaluation of image quality. Processing and Analysis - Image enhancement, restoration, segmentation, registration, multispectral, colour and texture processing, multiresolution processing and wavelets, morphological operations, stereoscopic and 3-D processing, motion detection and estimation, video and image sequence processing. Implementations and Architectures - Image and video processing hardware and software, design and construction, architectures and software, neural, adaptive, and fuzzy processing. Coding and Transmission - Image and video compression and coding, compression standards, noise modelling, visual information networks, streamed video. Retrieval and Multimedia - Storage of images and video, database design, image retrieval, video annotation and editing, mixed media incorporating visual information, multimedia systems and applications, image and video watermarking, steganography. Applications - Innovative application of image and video processing technologies to any field, including life sciences, earth sciences, astronomy, document processing and security. Current Special Issue Call for Papers: Evolutionary Computation for Image Processing - https://digital-library.theiet.org/files/IET_IPR_CFP_EC.pdf AI-Powered 3D Vision - https://digital-library.theiet.org/files/IET_IPR_CFP_AIPV.pdf Multidisciplinary advancement of Imaging Technologies: From Medical Diagnostics and Genomics to Cognitive Machine Vision, and Artificial Intelligence - https://digital-library.theiet.org/files/IET_IPR_CFP_IST.pdf Deep Learning for 3D Reconstruction - https://digital-library.theiet.org/files/IET_IPR_CFP_DLR.pdf