{"title":"实现人类水平的视觉系统","authors":"JianHao Ding, TieJun Huang","doi":"10.1007/s11431-024-2762-5","DOIUrl":null,"url":null,"abstract":"<p>The human visual system is a complex and interconnected network comprising billions of neurons. It plays an essential role in translating environmental light stimuli into information that guides and shapes human perception and action. Research on the visual system aims to uncover the underlying neural structure principles of human visual perception and their possible applications. Currently, there are two main approaches: biological system analysis and simulation, artificial intelligence models based on deep learning. Here we aim to discuss the two approaches to human-level vision systems. Deep learning has significantly impacted the field of vision with achievements in representation, modeling, and hardware design. However, there is still a significant gap between deep learning models and the human visual system in terms of scalability, transferability, and sustainability. The progress of the biological visual system can help fill the gap by further understanding the properties and functions of different components of the system. We take the efforts of reconstructing the retina as an example to illustrate that even if we are unable to replicate the visual system on a computer right now, we can still learn a lot by combining existing research outcomes in neuroscience. At the end of the paper, we suggest tracing back to gradually build visual systems from the computational counterpart of biological structures to achieve a human-level vision system in the future.</p>","PeriodicalId":21612,"journal":{"name":"Science China Technological Sciences","volume":"49 1","pages":""},"PeriodicalIF":4.4000,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards human-leveled vision systems\",\"authors\":\"JianHao Ding, TieJun Huang\",\"doi\":\"10.1007/s11431-024-2762-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The human visual system is a complex and interconnected network comprising billions of neurons. It plays an essential role in translating environmental light stimuli into information that guides and shapes human perception and action. Research on the visual system aims to uncover the underlying neural structure principles of human visual perception and their possible applications. Currently, there are two main approaches: biological system analysis and simulation, artificial intelligence models based on deep learning. Here we aim to discuss the two approaches to human-level vision systems. Deep learning has significantly impacted the field of vision with achievements in representation, modeling, and hardware design. However, there is still a significant gap between deep learning models and the human visual system in terms of scalability, transferability, and sustainability. The progress of the biological visual system can help fill the gap by further understanding the properties and functions of different components of the system. We take the efforts of reconstructing the retina as an example to illustrate that even if we are unable to replicate the visual system on a computer right now, we can still learn a lot by combining existing research outcomes in neuroscience. At the end of the paper, we suggest tracing back to gradually build visual systems from the computational counterpart of biological structures to achieve a human-level vision system in the future.</p>\",\"PeriodicalId\":21612,\"journal\":{\"name\":\"Science China Technological Sciences\",\"volume\":\"49 1\",\"pages\":\"\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2024-07-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Science China Technological Sciences\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s11431-024-2762-5\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science China Technological Sciences","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11431-024-2762-5","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
The human visual system is a complex and interconnected network comprising billions of neurons. It plays an essential role in translating environmental light stimuli into information that guides and shapes human perception and action. Research on the visual system aims to uncover the underlying neural structure principles of human visual perception and their possible applications. Currently, there are two main approaches: biological system analysis and simulation, artificial intelligence models based on deep learning. Here we aim to discuss the two approaches to human-level vision systems. Deep learning has significantly impacted the field of vision with achievements in representation, modeling, and hardware design. However, there is still a significant gap between deep learning models and the human visual system in terms of scalability, transferability, and sustainability. The progress of the biological visual system can help fill the gap by further understanding the properties and functions of different components of the system. We take the efforts of reconstructing the retina as an example to illustrate that even if we are unable to replicate the visual system on a computer right now, we can still learn a lot by combining existing research outcomes in neuroscience. At the end of the paper, we suggest tracing back to gradually build visual systems from the computational counterpart of biological structures to achieve a human-level vision system in the future.
期刊介绍:
Science China Technological Sciences, an academic journal cosponsored by the Chinese Academy of Sciences and the National Natural Science Foundation of China, and published by Science China Press, is committed to publishing high-quality, original results in both basic and applied research.
Science China Technological Sciences is published in both print and electronic forms. It is indexed by Science Citation Index.
Categories of articles:
Reviews summarize representative results and achievements in a particular topic or an area, comment on the current state of research, and advise on the research directions. The author’s own opinion and related discussion is requested.
Research papers report on important original results in all areas of technological sciences.
Brief reports present short reports in a timely manner of the latest important results.