Ran Jing , Yanlin Shao , Qihong Zeng , Yuangang Liu , Wei Wei , Binqing Gan , Xiaolei Duan
{"title":"Multimodal feature integration network for lithology identification from point cloud data","authors":"Ran Jing , Yanlin Shao , Qihong Zeng , Yuangang Liu , Wei Wei , Binqing Gan , Xiaolei Duan","doi":"10.1016/j.cageo.2024.105775","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate lithology identification from outcrop surfaces is crucial for interpreting geological 3D data. However, challenges arise due to factors such as severe weathering and vegetation coverage, which hinder achieving ideal identification results with both accuracy and efficiency. The integration of 3D point cloud technology and deep learning methodologies presents a promising solution to address these challenges. In this study, we propose a novel multimodal feature integration network designed to distinguish various rock types from point clouds. Our network incorporates a multimodal feature integration block equipped with multiple attention mechanisms to extract representative deep features, along with a hierarchical feature separation block to leverage these features for precise segmentation of points corresponding to different lithologies. Furthermore, we introduce a specialized loss function tailored for rock type identification to enhance network training. Through experiments involving point cloud sampling strategies and loss function evaluation, we identify the optimal network configuration. Comparative analyses against baseline methods demonstrate the superiority of our proposed network across diverse study areas reconstructed from UAV images and laser scanner data, exhibiting improved visual appearance and metric values (Accuracy = 0.978, mean Accuracy = 0.895, mean IoU = 0.857). These findings underscore the efficacy of the multimodal feature integration network as a promising approach for lithology identification tasks in various digital outcrop models derived from heterogeneous data sources.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"194 ","pages":"Article 105775"},"PeriodicalIF":4.2000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Geosciences","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098300424002589","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate lithology identification from outcrop surfaces is crucial for interpreting geological 3D data. However, challenges arise due to factors such as severe weathering and vegetation coverage, which hinder achieving ideal identification results with both accuracy and efficiency. The integration of 3D point cloud technology and deep learning methodologies presents a promising solution to address these challenges. In this study, we propose a novel multimodal feature integration network designed to distinguish various rock types from point clouds. Our network incorporates a multimodal feature integration block equipped with multiple attention mechanisms to extract representative deep features, along with a hierarchical feature separation block to leverage these features for precise segmentation of points corresponding to different lithologies. Furthermore, we introduce a specialized loss function tailored for rock type identification to enhance network training. Through experiments involving point cloud sampling strategies and loss function evaluation, we identify the optimal network configuration. Comparative analyses against baseline methods demonstrate the superiority of our proposed network across diverse study areas reconstructed from UAV images and laser scanner data, exhibiting improved visual appearance and metric values (Accuracy = 0.978, mean Accuracy = 0.895, mean IoU = 0.857). These findings underscore the efficacy of the multimodal feature integration network as a promising approach for lithology identification tasks in various digital outcrop models derived from heterogeneous data sources.
期刊介绍:
Computers & Geosciences publishes high impact, original research at the interface between Computer Sciences and Geosciences. Publications should apply modern computer science paradigms, whether computational or informatics-based, to address problems in the geosciences.