用于加速机器视觉的多通道元模拟器

IF 38.1 1区 材料科学 Q1 MATERIALS SCIENCE, MULTIDISCIPLINARY
Hanyu Zheng, Quan Liu, Ivan I. Kravchenko, Xiaomeng Zhang, Yuankai Huo, Jason G. Valentine
{"title":"用于加速机器视觉的多通道元模拟器","authors":"Hanyu Zheng, Quan Liu, Ivan I. Kravchenko, Xiaomeng Zhang, Yuankai Huo, Jason G. Valentine","doi":"10.1038/s41565-023-01557-2","DOIUrl":null,"url":null,"abstract":"Rapid developments in machine vision technology have impacted a variety of applications, such as medical devices and autonomous driving systems. These achievements, however, typically necessitate digital neural networks with the downside of heavy computational requirements and consequent high energy consumption. As a result, real-time decision-making is hindered when computational resources are not readily accessible. Here we report a meta-imager designed to work together with a digital back end to offload computationally expensive convolution operations into high-speed, low-power optics. In this architecture, metasurfaces enable both angle and polarization multiplexing to create multiple information channels that perform positively and negatively valued convolution operations in a single shot. We use our meta-imager for object classification, achieving 98.6% accuracy in handwritten digits and 88.8% accuracy in fashion images. Owing to its compactness, high speed and low power consumption, our approach could find a wide range of applications in artificial intelligence and machine vision applications. A metasurface-based approach is used to implement computationally expensive digital convolution operations in high-speed, low-power optics for improving the latency and power consumption of machine vision systems.","PeriodicalId":18915,"journal":{"name":"Nature nanotechnology","volume":"19 4","pages":"471-478"},"PeriodicalIF":38.1000,"publicationDate":"2024-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multichannel meta-imagers for accelerating machine vision\",\"authors\":\"Hanyu Zheng, Quan Liu, Ivan I. Kravchenko, Xiaomeng Zhang, Yuankai Huo, Jason G. Valentine\",\"doi\":\"10.1038/s41565-023-01557-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Rapid developments in machine vision technology have impacted a variety of applications, such as medical devices and autonomous driving systems. These achievements, however, typically necessitate digital neural networks with the downside of heavy computational requirements and consequent high energy consumption. As a result, real-time decision-making is hindered when computational resources are not readily accessible. Here we report a meta-imager designed to work together with a digital back end to offload computationally expensive convolution operations into high-speed, low-power optics. In this architecture, metasurfaces enable both angle and polarization multiplexing to create multiple information channels that perform positively and negatively valued convolution operations in a single shot. We use our meta-imager for object classification, achieving 98.6% accuracy in handwritten digits and 88.8% accuracy in fashion images. Owing to its compactness, high speed and low power consumption, our approach could find a wide range of applications in artificial intelligence and machine vision applications. A metasurface-based approach is used to implement computationally expensive digital convolution operations in high-speed, low-power optics for improving the latency and power consumption of machine vision systems.\",\"PeriodicalId\":18915,\"journal\":{\"name\":\"Nature nanotechnology\",\"volume\":\"19 4\",\"pages\":\"471-478\"},\"PeriodicalIF\":38.1000,\"publicationDate\":\"2024-01-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nature nanotechnology\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://www.nature.com/articles/s41565-023-01557-2\",\"RegionNum\":1,\"RegionCategory\":\"材料科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MATERIALS SCIENCE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature nanotechnology","FirstCategoryId":"88","ListUrlMain":"https://www.nature.com/articles/s41565-023-01557-2","RegionNum":1,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

机器视觉技术的飞速发展影响了医疗设备和自动驾驶系统等各种应用。然而,这些成就通常需要使用数字神经网络,其缺点是需要大量计算资源,因此能耗较高。因此,当计算资源不易获取时,实时决策就会受到阻碍。在此,我们报告了一种元成像仪,其设计目的是与数字后端协同工作,将计算成本高昂的卷积操作卸载到高速、低功耗的光学器件中。在这一架构中,元表面实现了角度和偏振多路复用,从而创建了多个信息通道,可在单次拍摄中执行正值和负值卷积操作。我们使用元成像仪进行物体分类,手写数字的准确率达到 98.6%,时尚图像的准确率达到 88.8%。由于其结构紧凑、速度快、功耗低,我们的方法可以在人工智能和机器视觉应用中找到广泛的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Multichannel meta-imagers for accelerating machine vision

Multichannel meta-imagers for accelerating machine vision

Multichannel meta-imagers for accelerating machine vision
Rapid developments in machine vision technology have impacted a variety of applications, such as medical devices and autonomous driving systems. These achievements, however, typically necessitate digital neural networks with the downside of heavy computational requirements and consequent high energy consumption. As a result, real-time decision-making is hindered when computational resources are not readily accessible. Here we report a meta-imager designed to work together with a digital back end to offload computationally expensive convolution operations into high-speed, low-power optics. In this architecture, metasurfaces enable both angle and polarization multiplexing to create multiple information channels that perform positively and negatively valued convolution operations in a single shot. We use our meta-imager for object classification, achieving 98.6% accuracy in handwritten digits and 88.8% accuracy in fashion images. Owing to its compactness, high speed and low power consumption, our approach could find a wide range of applications in artificial intelligence and machine vision applications. A metasurface-based approach is used to implement computationally expensive digital convolution operations in high-speed, low-power optics for improving the latency and power consumption of machine vision systems.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Nature nanotechnology
Nature nanotechnology 工程技术-材料科学:综合
CiteScore
59.70
自引率
0.80%
发文量
196
审稿时长
4-8 weeks
期刊介绍: Nature Nanotechnology is a prestigious journal that publishes high-quality papers in various areas of nanoscience and nanotechnology. The journal focuses on the design, characterization, and production of structures, devices, and systems that manipulate and control materials at atomic, molecular, and macromolecular scales. It encompasses both bottom-up and top-down approaches, as well as their combinations. Furthermore, Nature Nanotechnology fosters the exchange of ideas among researchers from diverse disciplines such as chemistry, physics, material science, biomedical research, engineering, and more. It promotes collaboration at the forefront of this multidisciplinary field. The journal covers a wide range of topics, from fundamental research in physics, chemistry, and biology, including computational work and simulations, to the development of innovative devices and technologies for various industrial sectors such as information technology, medicine, manufacturing, high-performance materials, energy, and environmental technologies. It includes coverage of organic, inorganic, and hybrid materials.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信