Yakun Ju;Ling Li;Xian Zhong;Yuan Rao;Yanru Liu;Junyu Dong;Alex C. Kot
{"title":"Underwater Surface Normal Reconstruction via Cross-Grained Photometric Stereo Transformer","authors":"Yakun Ju;Ling Li;Xian Zhong;Yuan Rao;Yanru Liu;Junyu Dong;Alex C. Kot","doi":"10.1109/JOE.2024.3458110","DOIUrl":null,"url":null,"abstract":"Modern ocean research necessitates high-precision 3-D underwater data acquisition. Photometric stereo is a critical technique for recovering high-resolution, dense surface normals of textureless objects, such as the seabed and underwater pipelines. This technique is fundamental for underwater robots engaged in ocean exploration and operational tasks. Traditional underwater photometric stereo methods account for distributed underwater media, such as light scattering. However, the deployment of devices in complex underwater environments (e.g., ocean currents) often results in misalignment and jitter among photometric stereo images. These challenges lead to inaccuracies in matching-based methods, particularly due to the lack of texture and varying illumination conditions. To address these issues, we propose the Cross-Grained Transformer Photometric Stereo (CGT-PS) Network. CGT-PS is designed to directly manage misaligned pixels caused by underwater jitter in an end-to-end manner. The proposed method consists of two main components: the local-grained and global-grained modules. The local-grained module utilizes a Shift operation to adjust pixels within a single-pixel span, effectively mitigating misalignment caused by motion without increasing computational cost. In contrast, the global-grained module performs nonlocal fusion learning, leveraging distant features to enhance the extraction of intricate structural details, cast shadows, and interreflection regions. Ablation studies confirm the efficacy of the proposed modules. Extensive experiments on photometric stereo benchmark data sets and real underwater photometric stereo samples demonstrate that our method achieves superior performance.","PeriodicalId":13191,"journal":{"name":"IEEE Journal of Oceanic Engineering","volume":"50 1","pages":"192-203"},"PeriodicalIF":3.8000,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Oceanic Engineering","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10721286/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
Abstract
Modern ocean research necessitates high-precision 3-D underwater data acquisition. Photometric stereo is a critical technique for recovering high-resolution, dense surface normals of textureless objects, such as the seabed and underwater pipelines. This technique is fundamental for underwater robots engaged in ocean exploration and operational tasks. Traditional underwater photometric stereo methods account for distributed underwater media, such as light scattering. However, the deployment of devices in complex underwater environments (e.g., ocean currents) often results in misalignment and jitter among photometric stereo images. These challenges lead to inaccuracies in matching-based methods, particularly due to the lack of texture and varying illumination conditions. To address these issues, we propose the Cross-Grained Transformer Photometric Stereo (CGT-PS) Network. CGT-PS is designed to directly manage misaligned pixels caused by underwater jitter in an end-to-end manner. The proposed method consists of two main components: the local-grained and global-grained modules. The local-grained module utilizes a Shift operation to adjust pixels within a single-pixel span, effectively mitigating misalignment caused by motion without increasing computational cost. In contrast, the global-grained module performs nonlocal fusion learning, leveraging distant features to enhance the extraction of intricate structural details, cast shadows, and interreflection regions. Ablation studies confirm the efficacy of the proposed modules. Extensive experiments on photometric stereo benchmark data sets and real underwater photometric stereo samples demonstrate that our method achieves superior performance.
期刊介绍:
The IEEE Journal of Oceanic Engineering (ISSN 0364-9059) is the online-only quarterly publication of the IEEE Oceanic Engineering Society (IEEE OES). The scope of the Journal is the field of interest of the IEEE OES, which encompasses all aspects of science, engineering, and technology that address research, development, and operations pertaining to all bodies of water. This includes the creation of new capabilities and technologies from concept design through prototypes, testing, and operational systems to sense, explore, understand, develop, use, and responsibly manage natural resources.