MULTIMEDIA '99Pub Date : 1999-10-30DOI: 10.1145/319463.319470
A. Uitdenbogerd, J. Zobel
{"title":"Melodic matching techniques for large music databases","authors":"A. Uitdenbogerd, J. Zobel","doi":"10.1145/319463.319470","DOIUrl":"https://doi.org/10.1145/319463.319470","url":null,"abstract":"With the growth in digital representations of music, and of music stored in these representations, it is increasingly attractive to search collections of music. One mode of search is by similarity, but, for music, similarity search presents several difficulties: in particular, for melodic query support, deciding what part of the music is likely to be perceived as the theme by a listener, and deciding whether two pieces of music with different sequences of notes represent the same theme. In this paper we propose a three-stage framework for matching pieces of music. We use the framework to compare a range of techniques for determining whether two pieces of music are similar, by experimentally testing their ability to retrieve different transcriptions of the same piece of music from a large collection of MIDI files. These experiments show that different comparison techniques differ widely in their effectiveness; and that, by instantiating the framework with appropriate music manipulation and comparison techniques, pieces of music that match a query can be identified in a large collection.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131134931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-30DOI: 10.1145/319463.319623
N. Dimitrova, R. Koenen, H. H. Yu, A. Zakhor, F. Galliano, C. Bouman
{"title":"Video portals for the next century (panel session)","authors":"N. Dimitrova, R. Koenen, H. H. Yu, A. Zakhor, F. Galliano, C. Bouman","doi":"10.1145/319463.319623","DOIUrl":"https://doi.org/10.1145/319463.319623","url":null,"abstract":"Panel organizer: Nevenka Dimitrova, Philips Research, Nevenka.Dimitrova@philips.com Panelists: Rob Koenen, KPN Research, R.H.Koenen@research.kpn.com Heather Yu, Panasonic Information & Networking Technology Lab, heathery@research.panasonic.com Avideh Zakhor, University of California at Berkeley, avz@EECS.Berkeley.EDU Francis Galliano, BBC, francis.galliano@bbc.co.uk Charles Bouman, Purdue Univeristy, bouman@ecn.purdue.edu","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126125752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-30DOI: 10.1145/319463.319469
A. Ginsberg, R. Viswanathan
{"title":"A calculus for dynamic customization of virtual environments","authors":"A. Ginsberg, R. Viswanathan","doi":"10.1145/319463.319469","DOIUrl":"https://doi.org/10.1145/319463.319469","url":null,"abstract":"Two problems in the design and deployment of multimedia applications are the lack of design-time and run-time flexibility. In this paper we discuss a general methodology for tackling these issues. The work presented here is an extension of the AlphaOmega framework of [4]. In that framework we showed how the intuitive notion of an object representing its properties and capabilities to other objects differentially could be exploited to provide a powerful but easy way to change the behavior and interfaces of an application, dynamically if desired. In this paper, we develop a formal approach to the basic principles of the AlphaOmega framework. This leads to the definition of a formal system called the αω-calculus. The αω-calculus identifies a set of programming language abstractions that can be consistently added to any object-oriented language. While the calculus captures the intuitive notions underlying the AlphaOmega framework, it also goes beyond the original framework in power and flexibility. We demonstrate the generality of our approach by working with an example that shows how it provides unifying abstractions for such seemingly diverse domains as interactive distance learning and various issues in the area of multimedia documents.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129568161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-30DOI: 10.1145/319463.319477
R. Hübscher
{"title":"Finding redundant paths in hypermedia","authors":"R. Hübscher","doi":"10.1145/319463.319477","DOIUrl":"https://doi.org/10.1145/319463.319477","url":null,"abstract":"Web page prerequisites can be used to constrain how a user can explore a web site. This can be used in a number of ways, but plays an especially important role in educational sites. If disjunctive and conjunctive constraints are used to describe the preferences, some pages may become redundant given the user's previous path and the user may not want to visit them. The redundancy of a page is not a local property and may depend on many preferences. This paper describes an efficient algorithm that finds these redundant pages and discusses some applications of the approach.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129003243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-30DOI: 10.1145/319463.319480
John R. Smith, R. Mohan, Chung-Sheng Li
{"title":"Scalable multimedia delivery for pervasive computing","authors":"John R. Smith, R. Mohan, Chung-Sheng Li","doi":"10.1145/319463.319480","DOIUrl":"https://doi.org/10.1145/319463.319480","url":null,"abstract":"Growing numbers of pervasive devices are gaining access to the Internet and other information sources. However, much of the rich multimedia content cannot be easily handled by the client devices with limited communication, processing, storage and display capabilities. In order to improve access, we are developing a system for scalable delivery of multimedia. The system uses an InfoPyramid for managing and manipulating multimedia content composed of video, images, audio and text. The InfoPyramid manages the different variations of media objects with different fidelities and modalities and generates and selects among the alternatives in order to adapt the delivery to different client devices. We describe a system for scalable multimedia delivery for a variety of client devices, including PDAs, HHCs, smart phones, TV browsers and color PCs.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123132100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-30DOI: 10.1145/319463.319631
Don Knox, T. Itagaki, I. Stewart, A. Nesbitt, I. Kemp
{"title":"Preservation of local sound periodicity with variable-rate video","authors":"Don Knox, T. Itagaki, I. Stewart, A. Nesbitt, I. Kemp","doi":"10.1145/319463.319631","DOIUrl":"https://doi.org/10.1145/319463.319631","url":null,"abstract":"A method of allowing pitch preservation of sound with variable-rate video playback is suggested. This is an important factor in monitoring of audio content for cueing purposes. Methods of separately considering a signal's frequency and time representations are considered with a view to performing time-scale modification with preservation of local periodicity (pitch). Particular emphasis is placed upon granulation in time of a sampled source — a technique based upon Dennis Gabor's landmark papers in 1946 and 1947 and developed in the field of computer music. This relatively simple method requires no prior signal analysis and is therefore a less computationally expensive method of achieving the goals stated above. This is an important point considering the need for real-time implementation. The process does however introduce some distortion, and investigation into how this may be minimised is necessary to produce acceptable results.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117107945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-30DOI: 10.1145/319463.319652
G. Pingali, G. Tunali, I. Carlbom
{"title":"Audio-visual tracking for natural interactivity","authors":"G. Pingali, G. Tunali, I. Carlbom","doi":"10.1145/319463.319652","DOIUrl":"https://doi.org/10.1145/319463.319652","url":null,"abstract":"The goal in user interfaces is natural interactivity unencumbered by sensor and display technology. In this paper, we propose that a multi-modal approach using inverse modeling techniques from computer vision, speech recognition, and acoustics can result in such interfaces. In particular, we demonstrate a system for audio-visual tracking, showing that such a system is more robust, more accurate, more compact, and yields more information than using a single modality for tracking. We also demonstrate how such a system can be used to find the talker among a group of individuals, and render 3D scenes to the user.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"34 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123363394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-30DOI: 10.1145/319463.319479
C. Greenhalgh, S. Benford, Gail Reynard
{"title":"A QoS architecture for collaborative virtual environments","authors":"C. Greenhalgh, S. Benford, Gail Reynard","doi":"10.1145/319463.319479","DOIUrl":"https://doi.org/10.1145/319463.319479","url":null,"abstract":"We present a QoS architecture for collaborative virtual environments (CVEs), focusing on the management of streamed video within shared virtual worlds. Users express QoS requirements by negotiating levels of mutual awareness using our previously defined spatial model of interaction. The architecture uses these awareness values as part of dynamic QoS management. A key aspect of the architecture is that it maintains a balance between the needs of a group of users as a whole (e.g., which streams are admitted onto a shared network) versus those of individual users within the group (e.g., which streams are subscribed to by a local host). We walk through a demonstration scenario, a virtual shopping mall, to show the architecture at work.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123457677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-30DOI: 10.1145/319463.319468
Susanne CJ Boll, W. Klas, Jochen Wandel
{"title":"A cross-media adaptation strategy for multimedia presentations","authors":"Susanne CJ Boll, W. Klas, Jochen Wandel","doi":"10.1145/319463.319468","DOIUrl":"https://doi.org/10.1145/319463.319468","url":null,"abstract":"Adaptation techniques for multimedia presentations are mainly concerned with switching between different qualities of single media elements to reduce the data volume and by this to adapt to limited presentation resources. This kind of adaptation, however, is limited to an inherent lower bound, i.e., the lowest acceptable technical quality of the respective media type. To overcome this limitation, we propose cross-media adaptation in which the presentation alternatives can be media elements of different media type, even different fragments. Thereby, the alternatives can extremely vary in media type and data volume and this enormously widens the possibilities to efficiently adapt to the current presentation resources. However, the adapted presentation must still convey the same content as the original one, hence, the substitution of media elements and fragments must preserve the presentation semantics. Therefore, our cross-media adaptation strategy provides models for the automatic augmentation of multimedia documents by semantically equivalent presentation alternatives. Additionally, during presentation, substitution models enforce a semantically correct information flow in case of dynamic adaptation to varying presentation resources. The cross-media adaptation strategy allows for flexible reuse of multimedia content in many different environments and, at the same time, maintains a semantically correct information flow of the presentation.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129181827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '99Pub Date : 1999-10-30DOI: 10.1145/319463.319482
Tae-uk Choi, Young-Ju Kim, Kidong Chung
{"title":"A prefetching scheme based on the analysis of user access patterns in news-on-demand system","authors":"Tae-uk Choi, Young-Ju Kim, Kidong Chung","doi":"10.1145/319463.319482","DOIUrl":"https://doi.org/10.1145/319463.319482","url":null,"abstract":"The NOD article makes a difference to VOD data in terms of media type, size, creation interval and user interactivity. Because of these intrinsic characteristics, user access patterns of the NOD article can be different from that of VOD data. In this paper, we analyze the log file of one electronic newspaper to show the short-term popularity and long-term popularity patterns. Based on these patterns, we propose LLBF (Largest Life-cycle Based Frequency) prefetching scheme that uses the two popularity patterns to cache a set of popular articles. In Simulation, we show that the proposed LLBF prefetching scheme increases hit ratio, and reduces the number of replacements more than other replacement algorithms as a small number of articles such as headline news is prefetched in main memory.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"60 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120884126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}