MULTIMEDIA '99Pub Date : 1999-10-30DOI: 10.1145/319463.319465
Hiroshi Kawasaki, T. Yatabe, K. Ikeuchi, M. Sakauchi
{"title":"Automatic modeling of a 3D city map from real-world video","authors":"Hiroshi Kawasaki, T. Yatabe, K. Ikeuchi, M. Sakauchi","doi":"10.1145/319463.319465","DOIUrl":"https://doi.org/10.1145/319463.319465","url":null,"abstract":"Mixed reality (MR) systems which integrate the virtual world and the real world have become a major topic in the research area of multimedia. As a practical application of these MR systems, we propose an efficient method for making a 3D map from real-world video data. The proposed method is an automatic organization method focusing on video objects to describe video data in an efficient way, i.e., by collating the real-world video data with map information using DP matching. To demonstrate the reliability of this method, we describe successful experiments that we performed using 3D information obtained from the real-world video data.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128736265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-30DOI: 10.1145/319463.319615
N. Sebe, M. Lew
{"title":"Robust color indexing","authors":"N. Sebe, M. Lew","doi":"10.1145/319463.319615","DOIUrl":"https://doi.org/10.1145/319463.319615","url":null,"abstract":"In content based image retrieval, color indexing is one of the most prevalent retrieval methods. In literature, most of the attention has been focussed on the color model with little or no consideration of the noise models. In this paper we investigate the problem of color indexing from a maximum likelihood perspective. We take into account the color model, the noise distribution, and the quantization of the color features. Furthermore, from the real noise distribution we derive a distortion measure, which consistently provides improved accuracy. Our investigation concludes with results on a real stock photography database, consisting of 11,000 color images.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127181665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-01DOI: 10.1145/319878.319945
L. Khan
{"title":"Structuring and querying personalized audio using ontologies","authors":"L. Khan","doi":"10.1145/319878.319945","DOIUrl":"https://doi.org/10.1145/319878.319945","url":null,"abstract":"User-customized information selection and delivery reduces the complexity of the overwhelming amount of information available to end-users. Our approach employs user profiles, data selection, and presentation facilities to deliver customized audio information to end-users. Specifically, we construct a domain-dependent ontology (a collection of key concepts and their inter-relationships) to enable user-profile construction to support the retrieval of personalized audio information. In this research, we show how a domain-dependent ontology facilitates the generation of metadata. We demonstrate that ontology provides end-users richer forms of information to query into the system rather than keyword search. We present how this ontology is used to generate information selection requests (database queries in SQL). We develop an efficient algorithm for conjunctive queries in client-server architecture. Finally, we discuss novel optimization techniques that improve query processing performance, utilizing the knowledge associated with the ontology. The approach we have developed is being implemented in the context of the Personal AudioCast project at the USC, Integrated Media Systems Center.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115652576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-01DOI: 10.1145/319878.319913
Sang Gil Kim, Sang Taek Kim, Young Sun Kim
{"title":"A work area adjustment method of a shared document in a multipoint multimedia conference system","authors":"Sang Gil Kim, Sang Taek Kim, Young Sun Kim","doi":"10.1145/319878.319913","DOIUrl":"https://doi.org/10.1145/319878.319913","url":null,"abstract":"Many companies are actively participating in the standardization activities of multimedia conference over packet based network such as the standardization of recommendation H.323 and T.120 series. Development of some standards of multimedia conference is completed. Multipoint document sharing method in the video conference is standardized as an ITU-T recommendation T. 128 in 1998. This standard is needed to add some capability item in order to make all terminal display the shared document in the local screen. In this paper, adjusting procedures of shared document are proposed for the participant to see all part of the document. An implementation example of the procedure is shown. A multipoint multimedia conference terminal(Multiwork PC Phone) based on ISDN and packet network has the function of document size adjustment. Multiwork PC Phone automates the document sharing procedure and dynamically adjusts the size of a shared document.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122654449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-01DOI: 10.1145/319878.319938
L. Rutledge, L. Hardman, D. Bulterman
{"title":"GRiNS: a graphical interface for SMIL","authors":"L. Rutledge, L. Hardman, D. Bulterman","doi":"10.1145/319878.319938","DOIUrl":"https://doi.org/10.1145/319878.319938","url":null,"abstract":"SMIL (Synchronized Multimedia Integration Language, pronounced “smile”) is the W3C recommendation for interactive synchronized multimedia distributed on the Web. It is an easy-to-author XML-compliant format that is similar in syntax to HTML. SMIL has been receiving a lot a attention within the Web community since its release in June 1998. Its it is supported by RealNetworks’ G2 system and several other commercial players. With its W3C backing, HTML-like syntax, its incorporation into the XML suite of formats and its early implementation and commercialization efforts, SMIL promises to be a widely-used solution for bringing true multimedia to the Web.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"118 Suppl 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121997256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-01DOI: 10.1145/319878.319903
David A. Turner, K. Ross
{"title":"Optimal streaming of synchronized multimedia presentations","authors":"David A. Turner, K. Ross","doi":"10.1145/319878.319903","DOIUrl":"https://doi.org/10.1145/319878.319903","url":null,"abstract":"A synchronized multimedia presentation consists of a collection of objects, with each object having one or more rendering intervals within the presentation timeline. These intervals specify the objects' start times and end times relative to the presentation timeline. In this paper we consider the problem of streaming a multimedia presentation from a server to a client over a bandwidth-limited communication network. We suppose that each of the static objects is layered-encoded. For a given maximum delay, we consider the problem of finding the optimal number of layers in each object in order to maximize a measure of the overall quality of the presentation. We devise efficient algorithms for determining an optimal policy for several natural criteria. We also consider the problem of gradual rendering of objects after their start times. We then apply the algorithms to a randomly generated presentation containing layer-encoded JPEG images.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117092270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-01DOI: 10.1145/319878.319884
Kim Ki Hong, Kim Poong Min, Kim Hyun Bin
{"title":"The natural sound field effect for multimedia contents","authors":"Kim Ki Hong, Kim Poong Min, Kim Hyun Bin","doi":"10.1145/319878.319884","DOIUrl":"https://doi.org/10.1145/319878.319884","url":null,"abstract":"The method of modeling the sound field informing the acoustical features of the virtual spaces is described. The sound field effect can be implemented through the appropriate combination of the early and late reflections resulting from the parameters users set up. This paper shows the method to model the various sound fields to user’s taste by computer simulation and the discussion of the results.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124065867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-01DOI: 10.1145/319878.319910
S. Dagtas, A. Ghafoor
{"title":"Indexing and retrieval of video based on spatial relation sequences","authors":"S. Dagtas, A. Ghafoor","doi":"10.1145/319878.319910","DOIUrl":"https://doi.org/10.1145/319878.319910","url":null,"abstract":"An important aspect of video data is its spatio-temporal aemantics. The relative positions and movements of the “interesting” objects in a video segment constitute a crucial component of the information to be retrieved in a multimedia database system. This is due to the dependency of the complex content descriptions on the apatio-temporal features. This paper addresses this challenge and provides an efficient technique for video data retrieval using descriptions of relative object movements. Such descriptions are used for content-baaed retrieval processing baaed on a graphical representation, named Coordinate Valued Neighborhood Graph.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131616781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-01DOI: 10.1145/319878.319926
Ryotaro Suzuki, Y. Iwadate, M. Minoh
{"title":"Image wave: a study on image synchronization","authors":"Ryotaro Suzuki, Y. Iwadate, M. Minoh","doi":"10.1145/319878.319926","DOIUrl":"https://doi.org/10.1145/319878.319926","url":null,"abstract":"Image Wave” is a new study paradigm based on the hypothesis which is summarized as follows. 1) Images like movies that exist in time have their own rhythms, and those rhythms are formed as synthesized waves, each of which has its own frequency, phase and fluctuation. 2) Images in the mind themselves are synthesized waves. According to this hypothesis, our study searches for a new way of multimedia component synchronization based on the internal rhythm information of the components.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129388293","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-01DOI: 10.1145/319878.319937
D. Ponceleón, A. Amir, S. Srinivasan, T. Syeda-Mahmood, D. Petkovic
{"title":"CueVideo: automated multimedia indexing and retrieval","authors":"D. Ponceleón, A. Amir, S. Srinivasan, T. Syeda-Mahmood, D. Petkovic","doi":"10.1145/319878.319937","DOIUrl":"https://doi.org/10.1145/319878.319937","url":null,"abstract":"We demonstrate CueVideo: a system for automated indexing and retrieval of multimedia. The system consists of the following components: video analysis and segmentation, visualization and summarization techniques, spoken document retrieval and cross-modal indexing of audio/video, related slides and text","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131044852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}