深度学习网络与视觉感知

Oxford Research Encyclopedia of Psychology Pub Date : 2021-08-31 DOI:10.1093/acrefore/9780190236557.013.841

Grace W. Lindsay, Thomas Serre

{"title":"深度学习网络与视觉感知","authors":"Grace W. Lindsay, Thomas Serre","doi":"10.1093/acrefore/9780190236557.013.841","DOIUrl":null,"url":null,"abstract":"Deep learning is an approach to artificial intelligence (AI) centered on the training of deep artificial neural networks to perform complex tasks. Since the early 21st century, this approach has led to record-breaking advances in AI, allowing computers to solve complex board games, video games, natural language-processing tasks, and vision problems. Neuroscientists and psychologists have also utilized these networks as models of biological information processing to understand language, motor control, cognition, audition, and—most commonly—vision. Specifically, early feedforward network architectures were inspired by visual neuroscience and are used to model neural activity and human behavior. They also provide useful representations of the perceptual space of images. The extent to which these models match data, however, depends on the methods used to characterize and compare them. The limitations of these feedforward neural networks to account for, for example, simple visual reasoning tasks, suggests that feedback mechanisms may be necessary to solve visual recognition tasks beyond image categorization.","PeriodicalId":339030,"journal":{"name":"Oxford Research Encyclopedia of Psychology","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Deep Learning Networks and Visual Perception\",\"authors\":\"Grace W. Lindsay, Thomas Serre\",\"doi\":\"10.1093/acrefore/9780190236557.013.841\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep learning is an approach to artificial intelligence (AI) centered on the training of deep artificial neural networks to perform complex tasks. Since the early 21st century, this approach has led to record-breaking advances in AI, allowing computers to solve complex board games, video games, natural language-processing tasks, and vision problems. Neuroscientists and psychologists have also utilized these networks as models of biological information processing to understand language, motor control, cognition, audition, and—most commonly—vision. Specifically, early feedforward network architectures were inspired by visual neuroscience and are used to model neural activity and human behavior. They also provide useful representations of the perceptual space of images. The extent to which these models match data, however, depends on the methods used to characterize and compare them. The limitations of these feedforward neural networks to account for, for example, simple visual reasoning tasks, suggests that feedback mechanisms may be necessary to solve visual recognition tasks beyond image categorization.\",\"PeriodicalId\":339030,\"journal\":{\"name\":\"Oxford Research Encyclopedia of Psychology\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Oxford Research Encyclopedia of Psychology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/acrefore/9780190236557.013.841\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Oxford Research Encyclopedia of Psychology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/acrefore/9780190236557.013.841","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

深度学习是一种以训练深度人工神经网络来执行复杂任务为中心的人工智能(AI)方法。自21世纪初以来，这种方法导致人工智能取得了破纪录的进步，使计算机能够解决复杂的棋盘游戏、视频游戏、自然语言处理任务和视觉问题。神经科学家和心理学家也利用这些网络作为生物信息处理的模型来理解语言、运动控制、认知、听觉，以及最常见的视觉。具体来说，早期的前馈网络架构受到视觉神经科学的启发，并用于模拟神经活动和人类行为。它们还提供了图像感知空间的有用表示。然而，这些模型与数据匹配的程度取决于用来描述和比较它们的方法。这些前馈神经网络的局限性，例如，简单的视觉推理任务，表明反馈机制可能是解决图像分类以外的视觉识别任务所必需的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Deep Learning Networks and Visual Perception

Deep learning is an approach to artificial intelligence (AI) centered on the training of deep artificial neural networks to perform complex tasks. Since the early 21st century, this approach has led to record-breaking advances in AI, allowing computers to solve complex board games, video games, natural language-processing tasks, and vision problems. Neuroscientists and psychologists have also utilized these networks as models of biological information processing to understand language, motor control, cognition, audition, and—most commonly—vision. Specifically, early feedforward network architectures were inspired by visual neuroscience and are used to model neural activity and human behavior. They also provide useful representations of the perceptual space of images. The extent to which these models match data, however, depends on the methods used to characterize and compare them. The limitations of these feedforward neural networks to account for, for example, simple visual reasoning tasks, suggests that feedback mechanisms may be necessary to solve visual recognition tasks beyond image categorization.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Oxford Research Encyclopedia of Psychology

自引率

0.00%

发文量