{"title":"用于图像精确语义分割的级联深度极化网络。","authors":"Jinyu Zhang, Xu Ma, Weili Chen, Hantang Chen, Gonzalo R Arce","doi":"10.1364/AO.561465","DOIUrl":null,"url":null,"abstract":"<p><p>Optical polarization imaging technology provides multi-dimensional light field information, encompassing spatial details and polarization data, which can be exploited for image semantic segmentation for target scene analysis. Most recent works focus on the development of neural networks with separate simple preprocessing steps to deal with the raw polarization images, which limit the accuracy of semantic segmentation. This paper proposes a novel, to the best of our knowledge, method, dubbed cascade deep polarization network (CDPN), to improve the performance of semantic segmentation by integrating preprocessing modules directly into the end-to-end deep learning work. The raw input data include the angle of linear polarization, degree of linear polarization, and a set of Stokes parameters. The multi-dimensional feature maps are extracted from the raw data through the image denoising, fusion, and enhancement modules, which are then concatenated with a backbone network to obtain the segmentation results. By collaboratively training the preprocessing modules and backbone network with self-supervised loss functions, we strive to find out the optimal segmentation solution. Experimental results show that the proposed method can effectively improve the segmentation accuracy, while maintaining fast computation speed.</p>","PeriodicalId":101299,"journal":{"name":"Applied optics","volume":"64 27","pages":"8139-8150"},"PeriodicalIF":0.0000,"publicationDate":"2025-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Cascade deep polarization network for precise image semantic segmentation.\",\"authors\":\"Jinyu Zhang, Xu Ma, Weili Chen, Hantang Chen, Gonzalo R Arce\",\"doi\":\"10.1364/AO.561465\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Optical polarization imaging technology provides multi-dimensional light field information, encompassing spatial details and polarization data, which can be exploited for image semantic segmentation for target scene analysis. Most recent works focus on the development of neural networks with separate simple preprocessing steps to deal with the raw polarization images, which limit the accuracy of semantic segmentation. This paper proposes a novel, to the best of our knowledge, method, dubbed cascade deep polarization network (CDPN), to improve the performance of semantic segmentation by integrating preprocessing modules directly into the end-to-end deep learning work. The raw input data include the angle of linear polarization, degree of linear polarization, and a set of Stokes parameters. The multi-dimensional feature maps are extracted from the raw data through the image denoising, fusion, and enhancement modules, which are then concatenated with a backbone network to obtain the segmentation results. By collaboratively training the preprocessing modules and backbone network with self-supervised loss functions, we strive to find out the optimal segmentation solution. Experimental results show that the proposed method can effectively improve the segmentation accuracy, while maintaining fast computation speed.</p>\",\"PeriodicalId\":101299,\"journal\":{\"name\":\"Applied optics\",\"volume\":\"64 27\",\"pages\":\"8139-8150\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied optics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1364/AO.561465\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied optics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1364/AO.561465","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Cascade deep polarization network for precise image semantic segmentation.
Optical polarization imaging technology provides multi-dimensional light field information, encompassing spatial details and polarization data, which can be exploited for image semantic segmentation for target scene analysis. Most recent works focus on the development of neural networks with separate simple preprocessing steps to deal with the raw polarization images, which limit the accuracy of semantic segmentation. This paper proposes a novel, to the best of our knowledge, method, dubbed cascade deep polarization network (CDPN), to improve the performance of semantic segmentation by integrating preprocessing modules directly into the end-to-end deep learning work. The raw input data include the angle of linear polarization, degree of linear polarization, and a set of Stokes parameters. The multi-dimensional feature maps are extracted from the raw data through the image denoising, fusion, and enhancement modules, which are then concatenated with a backbone network to obtain the segmentation results. By collaboratively training the preprocessing modules and backbone network with self-supervised loss functions, we strive to find out the optimal segmentation solution. Experimental results show that the proposed method can effectively improve the segmentation accuracy, while maintaining fast computation speed.