{"title":"Using causal scene analysis to direct focus of attention","authors":"L. Birnbaum, M. Brand, P. Cooper","doi":"10.1109/WQV.1993.262953","DOIUrl":null,"url":null,"abstract":"Vision should provide an explanation of the scene in terms of a causal semantics-what affects what, and why. For mobile agents, the structural integrity of the immediate environment is a major concern. Thus, an important part of the causal explanation of static scenes is what supports what, or, counterfactually: Why aren't things moving? The authors use simple naive physical knowledge as the basis of a vertically integrated vision system that explains arbitrarily complex stacked block structures. The semantics provides a basis for controlling the application of visual attention, and forms a framework for the explanation that is generated. They show how the program sequentially explores scenes of complex blocks structures, identifies functional substructures such as arches and cantilevers, and develops an explanation of why the whole construction stands and the role of each block in its stability.<<ETX>>","PeriodicalId":309941,"journal":{"name":"[1993] Proceedings IEEE Workshop on Qualitative Vision","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1993-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"[1993] Proceedings IEEE Workshop on Qualitative Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WQV.1993.262953","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Vision should provide an explanation of the scene in terms of a causal semantics-what affects what, and why. For mobile agents, the structural integrity of the immediate environment is a major concern. Thus, an important part of the causal explanation of static scenes is what supports what, or, counterfactually: Why aren't things moving? The authors use simple naive physical knowledge as the basis of a vertically integrated vision system that explains arbitrarily complex stacked block structures. The semantics provides a basis for controlling the application of visual attention, and forms a framework for the explanation that is generated. They show how the program sequentially explores scenes of complex blocks structures, identifies functional substructures such as arches and cantilevers, and develops an explanation of why the whole construction stands and the role of each block in its stability.<>