Ruiqi Liang, Shuai Wang, Yiying Dong, Liu Li, Ying Kuang, Bohan Zhang and Yuanmu Yang*,
{"title":"Metasurface-Generated Large and Arbitrary Analog Convolution Kernels for Accelerated Machine Vision","authors":"Ruiqi Liang, Shuai Wang, Yiying Dong, Liu Li, Ying Kuang, Bohan Zhang and Yuanmu Yang*, ","doi":"10.1021/acsphotonics.4c0187410.1021/acsphotonics.4c01874","DOIUrl":null,"url":null,"abstract":"<p >In the rapidly evolving field of artificial intelligence, convolutional neural networks are essential for tackling complex challenges, such as machine vision and medical diagnosis. Recently, to address the challenges in processing speed and power consumption of conventional digital convolution operations, many optical components have been suggested to replace the digital convolution layer in the neural network, accelerating various machine vision tasks. Nonetheless, the analogous nature of the optical convolution kernel has not been fully explored. Here, we develop a spatial frequency domain training method to create arbitrarily shaped analog convolution kernels using an optical metasurface as the convolution layer, with its receptive field largely surpassing digital convolution kernels. By employing spatial multiplexing, the multiple parallel convolution kernels with both positive and negative weights are generated under the incoherent illumination condition. We experimentally demonstrate a 98.59% classification accuracy on the MNIST data set, with simulations showing 92.63% and 68.67% accuracy on the Fashion-MNIST and CIFAR-10 data sets with additional digital layers. This work underscores the unique advantage of analogue optical convolution, offering a promising avenue to accelerate machine vision tasks, especially in edge devices.</p>","PeriodicalId":23,"journal":{"name":"ACS Photonics","volume":"11 12","pages":"5430–5438 5430–5438"},"PeriodicalIF":6.7000,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Photonics","FirstCategoryId":"101","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acsphotonics.4c01874","RegionNum":1,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
In the rapidly evolving field of artificial intelligence, convolutional neural networks are essential for tackling complex challenges, such as machine vision and medical diagnosis. Recently, to address the challenges in processing speed and power consumption of conventional digital convolution operations, many optical components have been suggested to replace the digital convolution layer in the neural network, accelerating various machine vision tasks. Nonetheless, the analogous nature of the optical convolution kernel has not been fully explored. Here, we develop a spatial frequency domain training method to create arbitrarily shaped analog convolution kernels using an optical metasurface as the convolution layer, with its receptive field largely surpassing digital convolution kernels. By employing spatial multiplexing, the multiple parallel convolution kernels with both positive and negative weights are generated under the incoherent illumination condition. We experimentally demonstrate a 98.59% classification accuracy on the MNIST data set, with simulations showing 92.63% and 68.67% accuracy on the Fashion-MNIST and CIFAR-10 data sets with additional digital layers. This work underscores the unique advantage of analogue optical convolution, offering a promising avenue to accelerate machine vision tasks, especially in edge devices.
期刊介绍:
Published as soon as accepted and summarized in monthly issues, ACS Photonics will publish Research Articles, Letters, Perspectives, and Reviews, to encompass the full scope of published research in this field.