{"title":"Robust satellite image analysis using probabilistic learning based graph optimization","authors":"Yangyu Tao, Lin Liang, Ying-Qing Xu","doi":"10.1109/WIAMIS.2009.5031452","DOIUrl":"https://doi.org/10.1109/WIAMIS.2009.5031452","url":null,"abstract":"We study the satellite image analysis problem with focus on extracting the man-made buildings. Instead of assuming simple rectangular building shape as in the most of previous work, we apply probabilistic learning method to statistical modeling the building structures. The model can achieve high robustness to large shape variation. We also propose a novel energy function to incorporate the statistical model into a graph optimization framework. Once the graph is constructed on image edges, the buildings can be extracted as closed cycles on graph efficiently and accurately. Experiments on real images demonstrate the effectiveness and robustness of the approach.","PeriodicalId":233839,"journal":{"name":"2009 10th Workshop on Image Analysis for Multimedia Interactive Services","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127522604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Background modeling for detecting move-then-stop arbitrary-long time video objects","authors":"Xiaodong Cai, F. Ali, E. Stipidis","doi":"10.1109/WIAMIS.2009.5031467","DOIUrl":"https://doi.org/10.1109/WIAMIS.2009.5031467","url":null,"abstract":"Obtaining a dynamically updated background reference image is an important and challenging task for video applications using background subtraction. This paper proposes a novel algorithm for dynamic video background reconstruction with move-then-stop arbitrary-long time video object detection enabled.","PeriodicalId":233839,"journal":{"name":"2009 10th Workshop on Image Analysis for Multimedia Interactive Services","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127555483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
N. Deligiannis, A. Munteanu, T. Clerckx, J. Cornelis, P. Schelkens
{"title":"Correlation channel estimation in pixel-domain distributed video coding","authors":"N. Deligiannis, A. Munteanu, T. Clerckx, J. Cornelis, P. Schelkens","doi":"10.1109/WIAMIS.2009.5031440","DOIUrl":"https://doi.org/10.1109/WIAMIS.2009.5031440","url":null,"abstract":"The paper addresses an essential problem in distributed video coding (DVC) which is modeling and estimation of the correlation channel. Current works assume the distribution of the correlation noise to be independent from the realization of the side-information. The paper demonstrates that this assumption is inaccurate and proposes a novel model which depends on the realization of the side-information. The proposed model is experimentally validated showing remarkable accuracy over conventional models. Driven by the side-information-dependency of the proposed model, a novel technique for estimating the correlation channel in video at the decoder is introduced. The proposed technique is incorporated into a unidirectional spatial-domain DVC system achieving similar performance compared to the state-of-the-art transform-domain DVC at a much lower encoding complexity.","PeriodicalId":233839,"journal":{"name":"2009 10th Workshop on Image Analysis for Multimedia Interactive Services","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117061969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Reuse of video annotations based on low-level descriptor similarity","authors":"M. Cordeiro, Cristina Ribeiro","doi":"10.1109/WIAMIS.2009.5031466","DOIUrl":"https://doi.org/10.1109/WIAMIS.2009.5031466","url":null,"abstract":"The paper proposes a mixed annotation approach that exploits the advantages of both automatic and manual annotation techniques. Annotated multimedia material is regarded as a source of low- to high-level feature mappings supporting the propagation of annotations to new multimedia material. Video analysis tools do not currently produce effective annotations for retrieval, while manual annotation is expensive. The proposed approach uses low-level feature similarity to guide the retrieval of keyword annotations and aims to preserve the high quality of manual annotations while reducing the time and cost per annotated video unit. The annotation tool assists users, suggesting keywords for an item that come from similar items according to low-level descriptors. The effectiveness of current descriptors has been evaluated in an experimental environment using 5 video collections and a set of MPEG-7 descriptors. The similarity results have been compared to manually evaluated similarity.","PeriodicalId":233839,"journal":{"name":"2009 10th Workshop on Image Analysis for Multimedia Interactive Services","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134373840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Automatic multimedia annotation through kernel combinations","authors":"D. D. Cao, Roberto Basili, R. Petitti","doi":"10.1109/WIAMIS.2009.5031434","DOIUrl":"https://doi.org/10.1109/WIAMIS.2009.5031434","url":null,"abstract":"An image classification model is here presented based on the integration of visual and textual properties supported by complex kernel functions. Linguistic descriptions derived through Information Extraction from Web pages are here integrated with the visual features corresponding to the images, according to independent kernel combinations. The impact of dimensionality reduction methods (i.e. LSA) and of proper combinations of redundant feature descriptions is also presented. The resulting workflow is largely applicable as the comparative evaluation discussed here confirms.","PeriodicalId":233839,"journal":{"name":"2009 10th Workshop on Image Analysis for Multimedia Interactive Services","volume":"38 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133479520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
E. Chrysochos, E. E. Varsaki, V. Fotopoulos, A. Skodras
{"title":"High capacity reversible data hiding using overlapping difference expansion","authors":"E. Chrysochos, E. E. Varsaki, V. Fotopoulos, A. Skodras","doi":"10.1109/WIAMIS.2009.5031447","DOIUrl":"https://doi.org/10.1109/WIAMIS.2009.5031447","url":null,"abstract":"Difference expansion (DE) has been widely used for reversible data hiding. In this work, a new DE based scheme is presented that uses consecutive, overlapping pairs, instead of the non-overlapping pairs or triads used by traditional DE derivatives. The scheme is superior to the existing approaches, both in capacity and PSNR terms. By applying multiple runs of the embedding process, a significant capacity gain is obtained at the expense of lower quality.","PeriodicalId":233839,"journal":{"name":"2009 10th Workshop on Image Analysis for Multimedia Interactive Services","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128458250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Posture annotation for studying affective interaction in multimodal corpora","authors":"Jean-Claude Martin","doi":"10.1109/WIAMIS.2009.5031483","DOIUrl":"https://doi.org/10.1109/WIAMIS.2009.5031483","url":null,"abstract":"Although the relations between affects and posture were the focus of several studies in Psychology, their consideration in multimodal corpora research remains scarce. In this paper we survey related studies and propose a framework for studying affective postures by integrating knowledge coming from multiple sources such as literature, video corpora, motion capture data, artistic design, and perception studies. We believe that such a multi-source framework can be useful for integrating posture annotation in multimodal corpora research.","PeriodicalId":233839,"journal":{"name":"2009 10th Workshop on Image Analysis for Multimedia Interactive Services","volume":"95 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128884910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Context-aware graph-based content representation for semantic navigation in multimedia news archives","authors":"M. Montagnuolo, A. Messina, Marco Ferri","doi":"10.1109/WIAMIS.2009.5031433","DOIUrl":"https://doi.org/10.1109/WIAMIS.2009.5031433","url":null,"abstract":"Representing the content relations in multimedia data plays a crucial role for the delivery of information navigation services. However, current tools for information extraction and visualisation are still far to be satisfactory. This paper addresses this task providing an unsupervised framework for graph-based multimedia news content representation. The system uses hybrid clustering and a graph partitioning technique to aggregate semantically related multimedia news from the Web and TV. These aggregations are represented by oriented graphs at increasing level of abstraction. Such an information can be accessed and browsed through a Web interface.","PeriodicalId":233839,"journal":{"name":"2009 10th Workshop on Image Analysis for Multimedia Interactive Services","volume":"178 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133255224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Timothy R. Brick, Jeffrey R. Spies, B. Theobald, I. Matthews, S. Boker
{"title":"High-presence, low-bandwidth, apparent 3D video-conferencing with a single camera","authors":"Timothy R. Brick, Jeffrey R. Spies, B. Theobald, I. Matthews, S. Boker","doi":"10.1109/WIAMIS.2009.5031494","DOIUrl":"https://doi.org/10.1109/WIAMIS.2009.5031494","url":null,"abstract":"Small digital video cameras have become increasingly common, appearing on portable consumer devices such as cellular phones. The widespread use of video-conferencing, however, is limited in part by the lack of bandwidth available on such devices. Also, video-conferencing can produce feelings of discomfort in conversants due to a lack of co-presence. Current techniques to increase co-presence are not practical in the consumer market due to the costly and elaborate equipment required (such as stereoscopic displays and multicamera arrays).","PeriodicalId":233839,"journal":{"name":"2009 10th Workshop on Image Analysis for Multimedia Interactive Services","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129709734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Christopher E. Peters, S. Asteriadis, G. Rebolledo-Méndez
{"title":"Modelling user attention for human-agent interaction","authors":"Christopher E. Peters, S. Asteriadis, G. Rebolledo-Méndez","doi":"10.1109/WIAMIS.2009.5031484","DOIUrl":"https://doi.org/10.1109/WIAMIS.2009.5031484","url":null,"abstract":"In this work, we propose a design for a user attention model featuring three core components. Our system components can work in real-time, offering indications of user attention from different sensory inputs (both visual and neurophysiological). The intention of the current work is to keep the equipment as unintrusive as possible, while keeping the confidence of the inputs as high as possible. We discuss potential applications of such a system, particularly with respect to evaluating user attentive behaviour during human-agent interactions and as a more natural interface for interacting with agents.","PeriodicalId":233839,"journal":{"name":"2009 10th Workshop on Image Analysis for Multimedia Interactive Services","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123776233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}