Yaxiong Xu, Bei Wu, Hui Jin, Chao Qian, Hongsheng Chen
{"title":"嵌入元表面编码器的多模态图像分割技术","authors":"Yaxiong Xu, Bei Wu, Hui Jin, Chao Qian, Hongsheng Chen","doi":"10.1002/andp.202400017","DOIUrl":null,"url":null,"abstract":"<p>Feature dimensionality plays an important role for modern medical diagnosis and image processing. In this work, the study introduces an optoelectronic neural network for multimodal image segmentation, which dramatically improves computing speed and decreases imaging acquisition cost in brain tumor diagnostics. Multi-layer metasurfaces are utilized as an image preprocessor that reduces image dimensionality at the physical layer. The low-dimensional image is then processed to a U-Net semantic segmentation network, to handle the complex and heterogeneous nature of brain image data. By using diffractive neural network, the metasurface encoder is optimized and physically constructed with high-efficiency transmission metasurfaces. The entire optoelectronic network attains a structural similarity index measure (SSIM) of 96%, demonstrating its potential to revolutionize on-site medical image processing with its high precision in segmenting brain imaging data.</p>","PeriodicalId":7896,"journal":{"name":"Annalen der Physik","volume":"536 8","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2024-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multimodality Image Segmentation Embedded with Metasurface Encoder\",\"authors\":\"Yaxiong Xu, Bei Wu, Hui Jin, Chao Qian, Hongsheng Chen\",\"doi\":\"10.1002/andp.202400017\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Feature dimensionality plays an important role for modern medical diagnosis and image processing. In this work, the study introduces an optoelectronic neural network for multimodal image segmentation, which dramatically improves computing speed and decreases imaging acquisition cost in brain tumor diagnostics. Multi-layer metasurfaces are utilized as an image preprocessor that reduces image dimensionality at the physical layer. The low-dimensional image is then processed to a U-Net semantic segmentation network, to handle the complex and heterogeneous nature of brain image data. By using diffractive neural network, the metasurface encoder is optimized and physically constructed with high-efficiency transmission metasurfaces. The entire optoelectronic network attains a structural similarity index measure (SSIM) of 96%, demonstrating its potential to revolutionize on-site medical image processing with its high precision in segmenting brain imaging data.</p>\",\"PeriodicalId\":7896,\"journal\":{\"name\":\"Annalen der Physik\",\"volume\":\"536 8\",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-05-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Annalen der Physik\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/andp.202400017\",\"RegionNum\":4,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PHYSICS, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annalen der Physik","FirstCategoryId":"101","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/andp.202400017","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHYSICS, MULTIDISCIPLINARY","Score":null,"Total":0}
Multimodality Image Segmentation Embedded with Metasurface Encoder
Feature dimensionality plays an important role for modern medical diagnosis and image processing. In this work, the study introduces an optoelectronic neural network for multimodal image segmentation, which dramatically improves computing speed and decreases imaging acquisition cost in brain tumor diagnostics. Multi-layer metasurfaces are utilized as an image preprocessor that reduces image dimensionality at the physical layer. The low-dimensional image is then processed to a U-Net semantic segmentation network, to handle the complex and heterogeneous nature of brain image data. By using diffractive neural network, the metasurface encoder is optimized and physically constructed with high-efficiency transmission metasurfaces. The entire optoelectronic network attains a structural similarity index measure (SSIM) of 96%, demonstrating its potential to revolutionize on-site medical image processing with its high precision in segmenting brain imaging data.
期刊介绍:
Annalen der Physik (AdP) is one of the world''s most renowned physics journals with an over 225 years'' tradition of excellence. Based on the fame of seminal papers by Einstein, Planck and many others, the journal is now tuned towards today''s most exciting findings including the annual Nobel Lectures. AdP comprises all areas of physics, with particular emphasis on important, significant and highly relevant results. Topics range from fundamental research to forefront applications including dynamic and interdisciplinary fields. The journal covers theory, simulation and experiment, e.g., but not exclusively, in condensed matter, quantum physics, photonics, materials physics, high energy, gravitation and astrophysics. It welcomes Rapid Research Letters, Original Papers, Review and Feature Articles.