{"title":"Selective attention mechanisms in a vision system based on neural networks","authors":"M. Rucci, P. Dario","doi":"10.1109/IROS.1993.583872","DOIUrl":null,"url":null,"abstract":"A system for visual recognition derived from a previously developed theoretical framework on the overall organization of the human visual system is proposed. The system operates dynamically by analyzing different parts of the input scene at variable levels of resolution through an attentional spotlight. A constant amount of information is gathered from the scene and a fixed dimension icon is produced, so that a trade-off occurs between the extension of the examined area and the level of resolution at which data are analyzed. The position of the spotlight and its dimensions are determined on the basis of the evolution of the recognition process. The icon is processed by a bottom-up path composed of a five-layer artificial neural network. The results of this net are analyzed by a planning module which determines if recognition has been achieved, or which action to undertake next. A top-down path, including a set of nets trained by the backpropagation algorithm, evaluates the parameters of the next sampling of information. The application of the system to object recognition with varying viewpoint and range from the camera is investigated.","PeriodicalId":299306,"journal":{"name":"Proceedings of 1993 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS '93)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1993-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of 1993 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS '93)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IROS.1993.583872","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
A system for visual recognition derived from a previously developed theoretical framework on the overall organization of the human visual system is proposed. The system operates dynamically by analyzing different parts of the input scene at variable levels of resolution through an attentional spotlight. A constant amount of information is gathered from the scene and a fixed dimension icon is produced, so that a trade-off occurs between the extension of the examined area and the level of resolution at which data are analyzed. The position of the spotlight and its dimensions are determined on the basis of the evolution of the recognition process. The icon is processed by a bottom-up path composed of a five-layer artificial neural network. The results of this net are analyzed by a planning module which determines if recognition has been achieved, or which action to undertake next. A top-down path, including a set of nets trained by the backpropagation algorithm, evaluates the parameters of the next sampling of information. The application of the system to object recognition with varying viewpoint and range from the camera is investigated.