{"title":"应用高效编码原理理解多感官和多模态感官信号的编码","authors":"Li Zhaoping","doi":"10.1016/j.visres.2024.108489","DOIUrl":null,"url":null,"abstract":"<div><div>Sensory neurons often encode multisensory or multimodal signals. For example, many medial superior temporal (MST) neurons are tuned to heading direction of self-motion based on visual (optic flow) signals and vestibular signals. Middle temporal (MT) cortical neurons are tuned to object depth from signals of two visual modalities: motion parallax and binocular disparity. A MST neuron’s preferred heading directions from different senses can be congruent (matched) or opposite from each other. Similarly, the preferred depths of a MT neuron from the two modalities are congruent in some neurons and opposite in other neurons. While the congruent tuning appears natural for cue integration, the functions of the opposite tuning have been puzzling. This paper explains these tunings from the efficient coding principle that sensory encoding extracts as much sensory information as possible while minimizing neural cost. It extends the previous applications of this principle to understand neural receptive fields in retina and the primary visual cortex, particularly multimodal encoding of cone signals or binocular signals. Congruent and opposite sensory signals that excite the congruent and opposite neurons, respectively, are the decorrelated sensory components that provide a general purpose, efficient, representation of sensory inputs before task specific object segmentation and recognition. It can be extended to encoding signals from more than two sensory sources, e.g., from three cone types. This framework also predicts a wider tuning width for the opposite than congruent neurons, neurons that are neither congruent nor opposite, and how neural receptive fields adapt to statistical changes of sensory environments.</div></div>","PeriodicalId":23670,"journal":{"name":"Vision Research","volume":"226 ","pages":"Article 108489"},"PeriodicalIF":1.5000,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Applying the efficient coding principle to understand encoding of multisensory and multimodality sensory signals\",\"authors\":\"Li Zhaoping\",\"doi\":\"10.1016/j.visres.2024.108489\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Sensory neurons often encode multisensory or multimodal signals. For example, many medial superior temporal (MST) neurons are tuned to heading direction of self-motion based on visual (optic flow) signals and vestibular signals. Middle temporal (MT) cortical neurons are tuned to object depth from signals of two visual modalities: motion parallax and binocular disparity. A MST neuron’s preferred heading directions from different senses can be congruent (matched) or opposite from each other. Similarly, the preferred depths of a MT neuron from the two modalities are congruent in some neurons and opposite in other neurons. While the congruent tuning appears natural for cue integration, the functions of the opposite tuning have been puzzling. This paper explains these tunings from the efficient coding principle that sensory encoding extracts as much sensory information as possible while minimizing neural cost. It extends the previous applications of this principle to understand neural receptive fields in retina and the primary visual cortex, particularly multimodal encoding of cone signals or binocular signals. Congruent and opposite sensory signals that excite the congruent and opposite neurons, respectively, are the decorrelated sensory components that provide a general purpose, efficient, representation of sensory inputs before task specific object segmentation and recognition. It can be extended to encoding signals from more than two sensory sources, e.g., from three cone types. This framework also predicts a wider tuning width for the opposite than congruent neurons, neurons that are neither congruent nor opposite, and how neural receptive fields adapt to statistical changes of sensory environments.</div></div>\",\"PeriodicalId\":23670,\"journal\":{\"name\":\"Vision Research\",\"volume\":\"226 \",\"pages\":\"Article 108489\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2024-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Vision Research\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0042698924001330\",\"RegionNum\":4,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Vision Research","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0042698924001330","RegionNum":4,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
Applying the efficient coding principle to understand encoding of multisensory and multimodality sensory signals
Sensory neurons often encode multisensory or multimodal signals. For example, many medial superior temporal (MST) neurons are tuned to heading direction of self-motion based on visual (optic flow) signals and vestibular signals. Middle temporal (MT) cortical neurons are tuned to object depth from signals of two visual modalities: motion parallax and binocular disparity. A MST neuron’s preferred heading directions from different senses can be congruent (matched) or opposite from each other. Similarly, the preferred depths of a MT neuron from the two modalities are congruent in some neurons and opposite in other neurons. While the congruent tuning appears natural for cue integration, the functions of the opposite tuning have been puzzling. This paper explains these tunings from the efficient coding principle that sensory encoding extracts as much sensory information as possible while minimizing neural cost. It extends the previous applications of this principle to understand neural receptive fields in retina and the primary visual cortex, particularly multimodal encoding of cone signals or binocular signals. Congruent and opposite sensory signals that excite the congruent and opposite neurons, respectively, are the decorrelated sensory components that provide a general purpose, efficient, representation of sensory inputs before task specific object segmentation and recognition. It can be extended to encoding signals from more than two sensory sources, e.g., from three cone types. This framework also predicts a wider tuning width for the opposite than congruent neurons, neurons that are neither congruent nor opposite, and how neural receptive fields adapt to statistical changes of sensory environments.
期刊介绍:
Vision Research is a journal devoted to the functional aspects of human, vertebrate and invertebrate vision and publishes experimental and observational studies, reviews, and theoretical and computational analyses. Vision Research also publishes clinical studies relevant to normal visual function and basic research relevant to visual dysfunction or its clinical investigation. Functional aspects of vision is interpreted broadly, ranging from molecular and cellular function to perception and behavior. Detailed descriptions are encouraged but enough introductory background should be included for non-specialists. Theoretical and computational papers should give a sense of order to the facts or point to new verifiable observations. Papers dealing with questions in the history of vision science should stress the development of ideas in the field.