{"title":"Cascade deep polarization network for precise image semantic segmentation.","authors":"Jinyu Zhang, Xu Ma, Weili Chen, Hantang Chen, Gonzalo R Arce","doi":"10.1364/AO.561465","DOIUrl":null,"url":null,"abstract":"<p><p>Optical polarization imaging technology provides multi-dimensional light field information, encompassing spatial details and polarization data, which can be exploited for image semantic segmentation for target scene analysis. Most recent works focus on the development of neural networks with separate simple preprocessing steps to deal with the raw polarization images, which limit the accuracy of semantic segmentation. This paper proposes a novel, to the best of our knowledge, method, dubbed cascade deep polarization network (CDPN), to improve the performance of semantic segmentation by integrating preprocessing modules directly into the end-to-end deep learning work. The raw input data include the angle of linear polarization, degree of linear polarization, and a set of Stokes parameters. The multi-dimensional feature maps are extracted from the raw data through the image denoising, fusion, and enhancement modules, which are then concatenated with a backbone network to obtain the segmentation results. By collaboratively training the preprocessing modules and backbone network with self-supervised loss functions, we strive to find out the optimal segmentation solution. Experimental results show that the proposed method can effectively improve the segmentation accuracy, while maintaining fast computation speed.</p>","PeriodicalId":101299,"journal":{"name":"Applied optics","volume":"64 27","pages":"8139-8150"},"PeriodicalIF":0.0000,"publicationDate":"2025-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied optics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1364/AO.561465","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Optical polarization imaging technology provides multi-dimensional light field information, encompassing spatial details and polarization data, which can be exploited for image semantic segmentation for target scene analysis. Most recent works focus on the development of neural networks with separate simple preprocessing steps to deal with the raw polarization images, which limit the accuracy of semantic segmentation. This paper proposes a novel, to the best of our knowledge, method, dubbed cascade deep polarization network (CDPN), to improve the performance of semantic segmentation by integrating preprocessing modules directly into the end-to-end deep learning work. The raw input data include the angle of linear polarization, degree of linear polarization, and a set of Stokes parameters. The multi-dimensional feature maps are extracted from the raw data through the image denoising, fusion, and enhancement modules, which are then concatenated with a backbone network to obtain the segmentation results. By collaboratively training the preprocessing modules and backbone network with self-supervised loss functions, we strive to find out the optimal segmentation solution. Experimental results show that the proposed method can effectively improve the segmentation accuracy, while maintaining fast computation speed.