On the importance of severely testing deep learning models of cognition

IF 2.1 3区心理学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Cognitive Systems Research Pub Date : 2023-08-22 DOI:10.1016/j.cogsys.2023.101158

Jeffrey S. Bowers , Gaurav Malhotra , Federico Adolfi , Marin Dujmović , Milton L. Montero , Valerio Biscione , Guillermo Puebla , John H. Hummel , Rachel F. Heaton

{"title":"On the importance of severely testing deep learning models of cognition","authors":"Jeffrey S. Bowers , Gaurav Malhotra , Federico Adolfi , Marin Dujmović , Milton L. Montero , Valerio Biscione , Guillermo Puebla , John H. Hummel , Rachel F. Heaton","doi":"10.1016/j.cogsys.2023.101158","DOIUrl":null,"url":null,"abstract":"<div><p>Researchers studying the correspondences between Deep Neural Networks (DNNs) and humans often give little consideration to severe testing when drawing conclusions from empirical findings, and this is impeding progress in building better models of minds. We first detail what we mean by severe testing and highlight how this is especially important when working with opaque models with many free parameters that may solve a given task in multiple different ways. Second, we provide multiple examples of researchers making strong claims regarding DNN-human similarities without engaging in severe testing of their hypotheses. Third, we consider why severe testing is undervalued. We provide evidence that part of the fault lies with the review process. There is now a widespread appreciation in many areas of science that a bias for publishing positive results (among other practices) is leading to a credibility crisis, but there seems less awareness of the problem here.</p></div>","PeriodicalId":55242,"journal":{"name":"Cognitive Systems Research","volume":"82 ","pages":"Article 101158"},"PeriodicalIF":2.1000,"publicationDate":"2023-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Systems Research","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S138904172300092X","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 1

Abstract

Researchers studying the correspondences between Deep Neural Networks (DNNs) and humans often give little consideration to severe testing when drawing conclusions from empirical findings, and this is impeding progress in building better models of minds. We first detail what we mean by severe testing and highlight how this is especially important when working with opaque models with many free parameters that may solve a given task in multiple different ways. Second, we provide multiple examples of researchers making strong claims regarding DNN-human similarities without engaging in severe testing of their hypotheses. Third, we consider why severe testing is undervalued. We provide evidence that part of the fault lies with the review process. There is now a widespread appreciation in many areas of science that a bias for publishing positive results (among other practices) is leading to a credibility crisis, but there seems less awareness of the problem here.

查看原文本刊更多论文

关于严格测试深度学习认知模型的重要性

研究深度神经网络(dnn)与人类之间对应关系的研究人员在从经验发现中得出结论时，往往很少考虑严格的测试，这阻碍了建立更好的思维模型的进展。我们首先详细说明严格测试的含义，并强调在使用具有许多自由参数的不透明模型时，这一点特别重要，这些模型可能以多种不同的方式解决给定的任务。其次，我们提供了多个研究人员在没有对他们的假设进行严格测试的情况下就dnn -人类相似性提出强烈主张的例子。第三，我们考虑为什么严格的测试被低估了。我们提供的证据表明，部分错误在于审查过程。现在，在许多科学领域，人们普遍认识到，发表积极结果的偏见(以及其他做法)正在导致可信度危机，但在这里，人们似乎对这个问题的认识较少。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Cognitive Systems Research 工程技术-计算机：人工智能

CiteScore

9.40

自引率

5.10%

发文量

审稿时长

>12 weeks

期刊介绍： Cognitive Systems Research is dedicated to the study of human-level cognition. As such, it welcomes papers which advance the understanding, design and applications of cognitive and intelligent systems, both natural and artificial. The journal brings together a broad community studying cognition in its many facets in vivo and in silico, across the developmental spectrum, focusing on individual capacities or on entire architectures. It aims to foster debate and integrate ideas, concepts, constructs, theories, models and techniques from across different disciplines and different perspectives on human-level cognition. The scope of interest includes the study of cognitive capacities and architectures - both brain-inspired and non-brain-inspired - and the application of cognitive systems to real-world problems as far as it offers insights relevant for the understanding of cognition. Cognitive Systems Research therefore welcomes mature and cutting-edge research approaching cognition from a systems-oriented perspective, both theoretical and empirically-informed, in the form of original manuscripts, short communications, opinion articles, systematic reviews, and topical survey articles from the fields of Cognitive Science (including Philosophy of Cognitive Science), Artificial Intelligence/Computer Science, Cognitive Robotics, Developmental Science, Psychology, and Neuroscience and Neuromorphic Engineering. Empirical studies will be considered if they are supplemented by theoretical analyses and contributions to theory development and/or computational modelling studies.