Deep Learning Networks and Visual Perception

Grace W. Lindsay, Thomas Serre
{"title":"Deep Learning Networks and Visual Perception","authors":"Grace W. Lindsay, Thomas Serre","doi":"10.1093/acrefore/9780190236557.013.841","DOIUrl":null,"url":null,"abstract":"Deep learning is an approach to artificial intelligence (AI) centered on the training of deep artificial neural networks to perform complex tasks. Since the early 21st century, this approach has led to record-breaking advances in AI, allowing computers to solve complex board games, video games, natural language-processing tasks, and vision problems. Neuroscientists and psychologists have also utilized these networks as models of biological information processing to understand language, motor control, cognition, audition, and—most commonly—vision. Specifically, early feedforward network architectures were inspired by visual neuroscience and are used to model neural activity and human behavior. They also provide useful representations of the perceptual space of images. The extent to which these models match data, however, depends on the methods used to characterize and compare them. The limitations of these feedforward neural networks to account for, for example, simple visual reasoning tasks, suggests that feedback mechanisms may be necessary to solve visual recognition tasks beyond image categorization.","PeriodicalId":339030,"journal":{"name":"Oxford Research Encyclopedia of Psychology","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Oxford Research Encyclopedia of Psychology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/acrefore/9780190236557.013.841","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Deep learning is an approach to artificial intelligence (AI) centered on the training of deep artificial neural networks to perform complex tasks. Since the early 21st century, this approach has led to record-breaking advances in AI, allowing computers to solve complex board games, video games, natural language-processing tasks, and vision problems. Neuroscientists and psychologists have also utilized these networks as models of biological information processing to understand language, motor control, cognition, audition, and—most commonly—vision. Specifically, early feedforward network architectures were inspired by visual neuroscience and are used to model neural activity and human behavior. They also provide useful representations of the perceptual space of images. The extent to which these models match data, however, depends on the methods used to characterize and compare them. The limitations of these feedforward neural networks to account for, for example, simple visual reasoning tasks, suggests that feedback mechanisms may be necessary to solve visual recognition tasks beyond image categorization.
深度学习网络与视觉感知
深度学习是一种以训练深度人工神经网络来执行复杂任务为中心的人工智能(AI)方法。自21世纪初以来,这种方法导致人工智能取得了破纪录的进步,使计算机能够解决复杂的棋盘游戏、视频游戏、自然语言处理任务和视觉问题。神经科学家和心理学家也利用这些网络作为生物信息处理的模型来理解语言、运动控制、认知、听觉,以及最常见的视觉。具体来说,早期的前馈网络架构受到视觉神经科学的启发,并用于模拟神经活动和人类行为。它们还提供了图像感知空间的有用表示。然而,这些模型与数据匹配的程度取决于用来描述和比较它们的方法。这些前馈神经网络的局限性,例如,简单的视觉推理任务,表明反馈机制可能是解决图像分类以外的视觉识别任务所必需的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信