{"title":"Gesture based 3D man-machine interaction using a single camera","authors":"Senthil Kumar, J. Segen","doi":"10.1109/MMCS.1999.779273","DOIUrl":"https://doi.org/10.1109/MMCS.1999.779273","url":null,"abstract":"This paper describes a new gesture based input interface system that allows users to control both 2D and 3D applications using simple hand gestures. Using a single camera attached to the computer, the system tracks the user's hand in three dimensions and computes up to four parameters in real-time (60 Hz). The system recognizes three gestures that can be interpreted as discrete commands to applications. This system is an off-shoot of an earlier system called Gesture VR that requires multiple cameras. Since the new system uses a single video source it can run readily on a standard home computer equipped with an inexpensive camera and is, therefore, accessible to most users. The system can be used with applications that require 2D and 2D interactions. Examples discussed in this paper include 3D virtual fly-throughs, graphical scene composers and video games.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123364358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Towards multimedia orchestra: a proposal for an interactive multimedia art creation system","authors":"Kazushi Nishimoto, K. Mase, S. Fels","doi":"10.1109/MMCS.1999.779322","DOIUrl":"https://doi.org/10.1109/MMCS.1999.779322","url":null,"abstract":"We propose a novel multimedia orchestra system named MusiKalScope-2. From our past experiences with multimedia art creation systems, we are aware of two issues: cognitive overload due to the simultaneous creation of music and graphics by a single performer, and finding a way to provide expert knowledge to a system. To address these issues, we propose two approaches: (1) use attribute based role allotment; and (2) the direct use of expert knowledge. We describe the construction of MusiKalScope-2 and illustrate the realization of the proposed methods.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114543666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Streaming video with transformation-based error concealment and reconstruction","authors":"B. Wah, Xiao Su","doi":"10.1109/MMCS.1999.779202","DOIUrl":"https://doi.org/10.1109/MMCS.1999.779202","url":null,"abstract":"Real-time video streaming over the Internet requires robust delivery mechanisms with low overhead. Traditional error control schemes are not attractive because they either add redundant information that may worsen network traffic, or rely solely on the inadequate capability of the decoder to do error concealment. As sophisticated concealment techniques cannot be employed in a real-time software playback scheme, we propose a simple yet efficient transformation-based error concealment algorithm. The algorithm applies a linear transformation to the original video signals, with the objective of minimizing the mean squared error if missing information were restored by simple averaging at the destination. We also describe two strategies to cope with error propagations in temporal differentially coded frames. Experimental results show that our proposed transformation-based reconstruction algorithm performs well in real Internet tests.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116363163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"MDC: a software tool for developing MPEG applications","authors":"Dongge Li, I. Sethi","doi":"10.1109/MMCS.1999.779243","DOIUrl":"https://doi.org/10.1109/MMCS.1999.779243","url":null,"abstract":"The paper presents a modularization method for the design of MPEG decoding software. Compared to the traditional MPEG decoder architecture, the architecture obtained from the modularization method can effectively improve the flexibility and reusability of MPEG decoding software without affecting the speed performance. Using this design approach, the paper presents MPEG Developing Classes (MDC), a software tool for developing MPEG video applications. The feedback from users of MDC shows that MDC is flexible in use and easy to understand. It allows users to develop their specific decoders without going into details of MPEG.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116507260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
C. Bouras, Dimitris Fotakis, V. Kapoulas, A. Koubek, H. Mayer, H. Rehatschek
{"title":"Virtual European School-VES","authors":"C. Bouras, Dimitris Fotakis, V. Kapoulas, A. Koubek, H. Mayer, H. Rehatschek","doi":"10.1109/MMCS.1999.778657","DOIUrl":"https://doi.org/10.1109/MMCS.1999.778657","url":null,"abstract":"The Virtual European School (VES) is an ongoing European project-funded by the Educational Multimedia Task Force initiative of the European Union-with the aim to develop a comprehensive online resource of teaching material for secondary school education. The system will be fed by a group of smaller publishing houses from different European countries (Austria, Italy, Greece, Great Britain) specialising in educational material. The offer will contain multimedia material, CBT products, and also additional background materials, such as passages from schoolbooks, or Internet resources. The technical structure of the VES system will be based on Internet technologies, with interconnected VES servers in each participating region. The multimedia material will be stored in a database, with multilingual annotations for each project. There exist three user groups within the VES: publishers, teachers and pupils.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"193 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113982708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Efficient content-based image retrieval based on color homogeneous objects segmentation and their spatial relationship characterization","authors":"Y. Chahir, Liming Chen","doi":"10.1109/MMCS.1999.778570","DOIUrl":"https://doi.org/10.1109/MMCS.1999.778570","url":null,"abstract":"We introduce several techniques which characterize color homogeneous objects and their spatial relationships for a more precise and efficient content based image searching. We first present a region growing technique for efficient color homogeneous object segmentation, and then we extend the 2D string to express spatial signatures for an accurate description of spatial relationships of objects within an image. Several optimizations, including dominant color histogram clustering, have also been proposed to an efficient search engine implementation. The experimental results that we have drawn so far show that our content based image searching techniques give a high precision while keeping a very good recall rate.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126476884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Querying by photographs: using virtual reality for content-based image retrieval","authors":"J. Assfalg, A. Bimbo, P. Pala","doi":"10.1109/MMCS.1999.779261","DOIUrl":"https://doi.org/10.1109/MMCS.1999.779261","url":null,"abstract":"The success of the 'query by example' paradigm for content based retrieval has lowered the interest of the research community in the development of new interaction paradigms. Nevertheless, in many contexts this can be inadequate, due to the difficulty for many users to paint significant examples and the impossibility-in many cases-of reproducing complex scenes. 3D interfaces and virtual reality based interaction paradigms offer new possibilities to overcome these limitations. In the prototype system presented, a virtual world can be interactively customized by adding objects to a predefined scene and editing object properties in terms of colours and textures. The system allows the user to navigate the 3D scene. While navigating the virtual world, the user is given the opportunity of taking some photographs used to query by content a database of images.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126520554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A live video imaging for multiple users","authors":"Y. Kameda, Hideaki Miyazaki, M. Minoh","doi":"10.1109/MMCS.1999.778607","DOIUrl":"https://doi.org/10.1109/MMCS.1999.778607","url":null,"abstract":"Many activities are held in a certain fixed space in our society. It becomes possible to participate in such activities from remote places. Video is one of the best media for this purpose, and it is important to consider how to image such activities. We propose a new video imaging method with multiple cameras for multiple users. Dynamic situations in an activity are defined for describing an imaging rule of each user, and they are detected by processing sensor data. Concretely, the imaging rule is described by a user with camera-works linked to each dynamic situation. With this description of the imaging rules, many users can request their own favourite camera-works, while the system mediates these requests to satisfy as many users as possible. We have built a prototype system and conducted an experiment in which several users could participate in lecture.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125658608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
F. Picó, Andrés Fuster Guilló, José Francisco Colom López
{"title":"Remote Web server of digital signal processors cards and its applications in education","authors":"F. Picó, Andrés Fuster Guilló, José Francisco Colom López","doi":"10.1109/MMCS.1999.779318","DOIUrl":"https://doi.org/10.1109/MMCS.1999.779318","url":null,"abstract":"A system is presented for the remote execution and planning of processes on digital signal processor cards. The hardware and software designed permits the sharing of processors and remote execution on the part of the users. We can distinguish three major modules: the Host, which acts as a signal processor hardware server; the customers, who send processes (typically programs written in C) and receive the compilation and results of the execution; and lastly, the host/customer connection (the link is carried out on a logic level, and thus independently of the type of net connection: Internet, Ethernet, token...). This situation is adequate for permitting access to high cost hardware platforms on the part of multiple users, facilitating the sharing of resources in research groups and the utilization in the classroom (at accessible costs) of high technology (where, until now, only the use of low cost processors has been envisaged).","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"177 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125812968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An architecture of distributed media servers for supporting guaranteed QoS and media indexing","authors":"Feng Cao, Jeffrey D. Smith, Kenji Takahashi","doi":"10.1109/MMCS.1999.778126","DOIUrl":"https://doi.org/10.1109/MMCS.1999.778126","url":null,"abstract":"In a distributed multimedia system, multimedia sessions may get involved with multiple media servers for the retrieval of the media data or the creation of new multimedia documents. To provide the guaranteed Quality of Service (QoS) to real-time applications such as continuous media transfers, the system resources in the media servers must be reserved to avoid contention during execution time. Media indexing is also needed to support searching the media data in such a distributed environment and to provide the necessary information about the usage of system resources for delivering the media data. Due to the overhead, a centralized approach of scheduling all the requests and searching all the media data from only one agent is not efficient, and not scalable. In this study, we propose a new architecture, dividing the media servers into multiple groups of the right size. Within each group, there is a registration agent and an index agent, which take care of the resource reservation, membership management, media indexing searching, and load balancing. We demonstrate how to provide the guaranteed QoS by scheduling the requests among the multiple groups, and show the collaborations between the registration agents and the index agents inside and outside a group. The mobile media servers can fit in this architecture by the updates of membership status from the registration agents. The new IETF drafts such as SIP, RTSP and RTP are embedded into this architecture to support the general multiparty multimedia applications for real-time media streams.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127959239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}