{"title":"用于相机数据采集、控制和图像估计的深度学习","authors":"D. Brady, Lu Fang, Zhan Ma","doi":"10.1364/AOP.398263","DOIUrl":null,"url":null,"abstract":"We review the impact of deep-learning technologies on camera architecture. The function of a camera is first to capture visual information and second to form an image. Conventionally, both functions are implemented in physical optics. Throughout the digital age, however, joint design of physical sampling and electronic processing, e.g., computational imaging, has been increasingly applied to improve these functions. Over the past five years, deep learning has radically improved the capacity of computational imaging. Here we briefly review the development of artificial neural networks and their recent intersection with computational imaging. We then consider in more detail how deep learning impacts the primary strategies of computational photography: focal plane modulation, lens design, and robotic control. With focal plane modulation, we show that deep learning improves signal inference to enable faster hyperspectral, polarization, and video capture while reducing the power per pixel by 10−100×. With lens design, deep learning improves multiple aperture image fusion to enable task-specific array cameras. With control, deep learning enables dynamic scene-specific control that may ultimately enable cameras that capture the entire optical data cube (the “light field”), rather than just a focal slice. Finally, we discuss how these three strategies impact the physical camera design as we seek to balance physical compactness and simplicity, information capacity, computational complexity, and visual fidelity.","PeriodicalId":48960,"journal":{"name":"Advances in Optics and Photonics","volume":"12 1","pages":"787-846"},"PeriodicalIF":25.2000,"publicationDate":"2020-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"Deep learning for camera data acquisition, control, and image estimation\",\"authors\":\"D. Brady, Lu Fang, Zhan Ma\",\"doi\":\"10.1364/AOP.398263\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We review the impact of deep-learning technologies on camera architecture. The function of a camera is first to capture visual information and second to form an image. Conventionally, both functions are implemented in physical optics. Throughout the digital age, however, joint design of physical sampling and electronic processing, e.g., computational imaging, has been increasingly applied to improve these functions. Over the past five years, deep learning has radically improved the capacity of computational imaging. Here we briefly review the development of artificial neural networks and their recent intersection with computational imaging. We then consider in more detail how deep learning impacts the primary strategies of computational photography: focal plane modulation, lens design, and robotic control. With focal plane modulation, we show that deep learning improves signal inference to enable faster hyperspectral, polarization, and video capture while reducing the power per pixel by 10−100×. With lens design, deep learning improves multiple aperture image fusion to enable task-specific array cameras. With control, deep learning enables dynamic scene-specific control that may ultimately enable cameras that capture the entire optical data cube (the “light field”), rather than just a focal slice. Finally, we discuss how these three strategies impact the physical camera design as we seek to balance physical compactness and simplicity, information capacity, computational complexity, and visual fidelity.\",\"PeriodicalId\":48960,\"journal\":{\"name\":\"Advances in Optics and Photonics\",\"volume\":\"12 1\",\"pages\":\"787-846\"},\"PeriodicalIF\":25.2000,\"publicationDate\":\"2020-12-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances in Optics and Photonics\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://doi.org/10.1364/AOP.398263\",\"RegionNum\":1,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"OPTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Optics and Photonics","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1364/AOP.398263","RegionNum":1,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPTICS","Score":null,"Total":0}
Deep learning for camera data acquisition, control, and image estimation
We review the impact of deep-learning technologies on camera architecture. The function of a camera is first to capture visual information and second to form an image. Conventionally, both functions are implemented in physical optics. Throughout the digital age, however, joint design of physical sampling and electronic processing, e.g., computational imaging, has been increasingly applied to improve these functions. Over the past five years, deep learning has radically improved the capacity of computational imaging. Here we briefly review the development of artificial neural networks and their recent intersection with computational imaging. We then consider in more detail how deep learning impacts the primary strategies of computational photography: focal plane modulation, lens design, and robotic control. With focal plane modulation, we show that deep learning improves signal inference to enable faster hyperspectral, polarization, and video capture while reducing the power per pixel by 10−100×. With lens design, deep learning improves multiple aperture image fusion to enable task-specific array cameras. With control, deep learning enables dynamic scene-specific control that may ultimately enable cameras that capture the entire optical data cube (the “light field”), rather than just a focal slice. Finally, we discuss how these three strategies impact the physical camera design as we seek to balance physical compactness and simplicity, information capacity, computational complexity, and visual fidelity.
期刊介绍:
Advances in Optics and Photonics (AOP) is an all-electronic journal that publishes comprehensive review articles and multimedia tutorials. It is suitable for students, researchers, faculty, business professionals, and engineers interested in optics and photonics. The content of the journal covers advancements in these fields, ranging from fundamental science to engineering applications.
The journal aims to capture the most significant developments in optics and photonics. It achieves this through long review articles and comprehensive tutorials written by prominent and respected authors who are at the forefront of their fields.
The journal goes beyond traditional text-based articles by enhancing the content with multimedia elements, such as animation and video. This multimedia approach helps to enhance the understanding and visualization of complex concepts.
AOP offers dedicated article preparation and peer-review support to assist authors throughout the publication process. This support ensures that the articles meet the journal's standards and are well-received by readers.
Additionally, AOP welcomes comments on published review articles, encouraging further discussions and insights from the scientific community.
In summary, Advances in Optics and Photonics is a comprehensive journal that provides authoritative and accessible content on advancements in optics and photonics. With its diverse range of articles, multimedia enhancements, and dedicated support, AOP serves as a valuable resource for professionals and researchers in these fields.