{"title":"分层和模块化注意","authors":"H. Wechsler","doi":"10.1109/CAMP.1995.521044","DOIUrl":null,"url":null,"abstract":"The flow of visual input reaching the eye consists of huge amounts of time-varying information. It is crucial for both biological vision and automated systems to perceive and comprehend such a constantly changing environment within a relatively short processing time. To cope with such a computational challenge, one should locate and analyze only the information relevant to the current task by quickly focusing on selected areas of the scene as needed. Attention makes perception computationally tractable and helps with tasks such as object recognition. Attention permeates the whole stream of visual computation, it is both hierarchical and modular, and it involves representations, processing and strategies. Attentional mechanisms are intimately related to adaptation processes, and high-level attention corresponds to competitive, functional and learned behavioral programs. Attention consists of both data- and model-driven processes and their relationships, and it covers several levels such as sensory, reactive and behavioral processes. An example of how attention can be implemented considers time-varying imagery and it shows how functional linked pyramids and zoom lens operations lead to the generation of visual saccades. Both the time-varying imagery and the corresponding recognition memory are organized as pyramids and uniform indexing and classification interfaces using an attention pyramid are established. This paper concludes with a discussion on promising venues for future research that are most likely to enhance our understanding of attentional mechanisms.","PeriodicalId":277209,"journal":{"name":"Proceedings of Conference on Computer Architectures for Machine Perception","volume":"177 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1995-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Hierarchical and modular attention\",\"authors\":\"H. Wechsler\",\"doi\":\"10.1109/CAMP.1995.521044\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The flow of visual input reaching the eye consists of huge amounts of time-varying information. It is crucial for both biological vision and automated systems to perceive and comprehend such a constantly changing environment within a relatively short processing time. To cope with such a computational challenge, one should locate and analyze only the information relevant to the current task by quickly focusing on selected areas of the scene as needed. Attention makes perception computationally tractable and helps with tasks such as object recognition. Attention permeates the whole stream of visual computation, it is both hierarchical and modular, and it involves representations, processing and strategies. Attentional mechanisms are intimately related to adaptation processes, and high-level attention corresponds to competitive, functional and learned behavioral programs. Attention consists of both data- and model-driven processes and their relationships, and it covers several levels such as sensory, reactive and behavioral processes. An example of how attention can be implemented considers time-varying imagery and it shows how functional linked pyramids and zoom lens operations lead to the generation of visual saccades. Both the time-varying imagery and the corresponding recognition memory are organized as pyramids and uniform indexing and classification interfaces using an attention pyramid are established. This paper concludes with a discussion on promising venues for future research that are most likely to enhance our understanding of attentional mechanisms.\",\"PeriodicalId\":277209,\"journal\":{\"name\":\"Proceedings of Conference on Computer Architectures for Machine Perception\",\"volume\":\"177 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1995-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of Conference on Computer Architectures for Machine Perception\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CAMP.1995.521044\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of Conference on Computer Architectures for Machine Perception","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CAMP.1995.521044","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The flow of visual input reaching the eye consists of huge amounts of time-varying information. It is crucial for both biological vision and automated systems to perceive and comprehend such a constantly changing environment within a relatively short processing time. To cope with such a computational challenge, one should locate and analyze only the information relevant to the current task by quickly focusing on selected areas of the scene as needed. Attention makes perception computationally tractable and helps with tasks such as object recognition. Attention permeates the whole stream of visual computation, it is both hierarchical and modular, and it involves representations, processing and strategies. Attentional mechanisms are intimately related to adaptation processes, and high-level attention corresponds to competitive, functional and learned behavioral programs. Attention consists of both data- and model-driven processes and their relationships, and it covers several levels such as sensory, reactive and behavioral processes. An example of how attention can be implemented considers time-varying imagery and it shows how functional linked pyramids and zoom lens operations lead to the generation of visual saccades. Both the time-varying imagery and the corresponding recognition memory are organized as pyramids and uniform indexing and classification interfaces using an attention pyramid are established. This paper concludes with a discussion on promising venues for future research that are most likely to enhance our understanding of attentional mechanisms.