{"title":"Navigating image space","authors":"Andrew Glennerster","doi":"10.1016/j.neuropsychologia.2025.109233","DOIUrl":null,"url":null,"abstract":"<div><div>Navigation means getting from here to there. Unfortunately, for biological navigation, there is no agreed definition of what we might mean by ‘here’ or ‘there’. Computer vision (‘Simultaneous Localisation and Mapping’, SLAM) uses a 3D world-based coordinate frame but that is a poor model for biological spatial representation. Another possibility is to use an image-based rather than a map-based representation. The image-based strategy is made simpler if the observer maintains fixation on a stationary point in the scene as they move. This strategy would require a system for relating different fixation points to one another as the observer moves through the environment. I describe how this can be done by, first, relating fixations to an egocentric representation of visual direction and, second, encoding egocentric representations in a coarse-to-fine hierarchy. The coarsest level of this hierarchy is, in some sense, a world-based frame as it does not vary with eye rotation or observer translation. This representation could be implemented as a ‘policy’, a term used in reinforcement learning to describe a set of states and associated actions, or a ‘graph’ that describes how images or sensory states can be connected by actions. I discuss some of the psychophysical evidence relating to these differing hypotheses about spatial representation and navigation. I argue that this evidence supports image-based rather than map-based representation.</div></div>","PeriodicalId":19279,"journal":{"name":"Neuropsychologia","volume":"219 ","pages":"Article 109233"},"PeriodicalIF":2.0000,"publicationDate":"2025-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuropsychologia","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S002839322500168X","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BEHAVIORAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Navigation means getting from here to there. Unfortunately, for biological navigation, there is no agreed definition of what we might mean by ‘here’ or ‘there’. Computer vision (‘Simultaneous Localisation and Mapping’, SLAM) uses a 3D world-based coordinate frame but that is a poor model for biological spatial representation. Another possibility is to use an image-based rather than a map-based representation. The image-based strategy is made simpler if the observer maintains fixation on a stationary point in the scene as they move. This strategy would require a system for relating different fixation points to one another as the observer moves through the environment. I describe how this can be done by, first, relating fixations to an egocentric representation of visual direction and, second, encoding egocentric representations in a coarse-to-fine hierarchy. The coarsest level of this hierarchy is, in some sense, a world-based frame as it does not vary with eye rotation or observer translation. This representation could be implemented as a ‘policy’, a term used in reinforcement learning to describe a set of states and associated actions, or a ‘graph’ that describes how images or sensory states can be connected by actions. I discuss some of the psychophysical evidence relating to these differing hypotheses about spatial representation and navigation. I argue that this evidence supports image-based rather than map-based representation.
期刊介绍:
Neuropsychologia is an international interdisciplinary journal devoted to experimental and theoretical contributions that advance understanding of human cognition and behavior from a neuroscience perspective. The journal will consider for publication studies that link brain function with cognitive processes, including attention and awareness, action and motor control, executive functions and cognitive control, memory, language, and emotion and social cognition.