MULTIMEDIA '00Pub Date : 2000-11-04DOI: 10.1145/357744.357759
P. Kauff, Klaas Schüür
{"title":"Fast motion estimation for real-time shape-adaptive MPEG-4 encoding","authors":"P. Kauff, Klaas Schüür","doi":"10.1145/357744.357759","DOIUrl":"https://doi.org/10.1145/357744.357759","url":null,"abstract":"This paper presents a fast motion estimator which can be used for real-time MPEG-4 encoding of arbitrarily shaped video objects. The approach is based on an existing algorithm which has already been applied successfully to format conversion. To exploit it for shape-adaptive coding, the algorithm has been adapted to the special properties of the MPEG-4 standard. With this new tool it becomes possible to encode arbitrarily shaped video objects (CIF, 25 Hz) in real-time with a MPEG-4 software encoder at a Pentium III 500 MHz. The real-time capability has to be paid by a slight loss of coding efficiency (about 0.2 dB in terms of rate-distortion measurements), compared to the MPEG-4 verification model.","PeriodicalId":234597,"journal":{"name":"MULTIMEDIA '00","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133989493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '00Pub Date : 2000-11-04DOI: 10.1145/357744.357953
R. Hamada, I. Ide, S. Sakai
{"title":"Associating cooking video with related textbook","authors":"R. Hamada, I. Ide, S. Sakai","doi":"10.1145/357744.357953","DOIUrl":"https://doi.org/10.1145/357744.357953","url":null,"abstract":"We have been handling video with supplementary documents, such as cooking programs, and are working on integration of such media. Through the integration, many applications will become possible, for example, reconstruction of multimedia data that supplement the information of each medium, construction of interactive database, or kitchen automation. Until now, we have proposed an integration system that perform integrative analysis of image, audio and text and associate each other. In this paper, we will introduce the latest text analysis result and discuss about future image and audio analysis of the proposed system.","PeriodicalId":234597,"journal":{"name":"MULTIMEDIA '00","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125454556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '00Pub Date : 2000-11-04DOI: 10.1145/357744.357756
E. Tzafestas
{"title":"Integrating drawing tools with behavioral modeling in digital painting","authors":"E. Tzafestas","doi":"10.1145/357744.357756","DOIUrl":"https://doi.org/10.1145/357744.357756","url":null,"abstract":"Our goal is to integrate traditional artistic media that demand hand dexterity (such as drawing) with intelligent systems techniques that may both constrain and nourish this dexterity. To this end we are experimenting with special brush tools that on top of traditional drawing provide possibilities that involve information processing and behavioral modeling. In this work, we are introducing a behavioral model as a color processing feature of our brush. More precisely, we use a regulation mechanism, that has been shown elsewhere to solve a typical problem in artificial ant societies, to distribute color on the drawing canvas. Our drawing tool, called “AntBrush”, manages to create controlled color variety during drawing by regulating its own quantity of color through picking and depositing color on the canvas. Different color effects are possible by controlling the brush's parameters on line.","PeriodicalId":234597,"journal":{"name":"MULTIMEDIA '00","volume":"177 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115285725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '00Pub Date : 2000-11-04DOI: 10.1145/357744.357747
Francesca Barrientos
{"title":"Continuous control of avatar gesture","authors":"Francesca Barrientos","doi":"10.1145/357744.357747","DOIUrl":"https://doi.org/10.1145/357744.357747","url":null,"abstract":"We are developing an application to give humans the ability to transmit nonverbal communication behaviors through an avatar: specifically gesture, the movements of the arms and hands that accompany speech when people speak face-to-face. In this application the user will have continuous control over the avatar animation. The avatar will be like a virtual puppet and the user will manipulate the avatar using not strings or rods but the controlled and skilled motions of their hand. The system tracks hand motions and then maps that motion to the joint motions of a three-dimensional articulated avatar. As part of this research we will try out different ways of tracking the user's hand. Eventually we plan to test the efficacy of this system by incorporating it into a networked virtual environment in which two or more people can interact through the virtual medium. Working with artists will enable us to design a system that is expressive and to better understand the expressive power of this system.","PeriodicalId":234597,"journal":{"name":"MULTIMEDIA '00","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124428542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '00Pub Date : 2000-11-04DOI: 10.1145/357744.357889
A. Benitez, D. Bulterman, B. Horowitz, A. Eleftheriadis, G. Vaithilingam
{"title":"Standards, interoperability and practice: who needs standards anyway? (Panel Session)","authors":"A. Benitez, D. Bulterman, B. Horowitz, A. Eleftheriadis, G. Vaithilingam","doi":"10.1145/357744.357889","DOIUrl":"https://doi.org/10.1145/357744.357889","url":null,"abstract":"","PeriodicalId":234597,"journal":{"name":"MULTIMEDIA '00","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126474003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '00Pub Date : 2000-11-04DOI: 10.1145/357744.357933
I. Ide, R. Hamada, S. Sakai, Hidehiko Tanaka
{"title":"Scene identification in news video by character region segmentation","authors":"I. Ide, R. Hamada, S. Sakai, Hidehiko Tanaka","doi":"10.1145/357744.357933","DOIUrl":"https://doi.org/10.1145/357744.357933","url":null,"abstract":"Reflecting the demand for recycling and retrieval of video, we are proposing an automatic indexing system for news video that considers correspondences between textual indices and image contents. In this paper, we focus on the background image content (i.e. scene) identification portion of the system. The analysis is performed by segmenting (human) character region from background region, and was applied to actual news video for evaluation. The overall result showed the effectiveness of the proposed method by 7 to 8%, and indicated that character existence itself is an important feature. Individual observation among various scenes indicated that multiple features should be combinatorily used according to each scene, and that the data set should be exponentially extended for higher performance.","PeriodicalId":234597,"journal":{"name":"MULTIMEDIA '00","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130860520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '00Pub Date : 2000-11-04DOI: 10.1145/357744.357898
R. Chandramouli, N. Memon
{"title":"A distributed detection framework for Steganalysis","authors":"R. Chandramouli, N. Memon","doi":"10.1145/357744.357898","DOIUrl":"https://doi.org/10.1145/357744.357898","url":null,"abstract":"Many watermarking algorithms have been proposed and studied for their robustness and other properties. But, there has been little effort is analyzing these algorithms for their vulnerability against detection by an adversary. As a general philosophy for robust watermarking the host signal and the watermarked signal are well separated in a statistical distance sense. This very nature can be exploited by an adversary to easily detect the watermark and perhaps remove it. We argue that the ability of a watermark to avoid detection by an adversary is a key factor that needs more attention. In this paper, we propose a framework, based on a distributed detection technique, that can be used by the adversary to study signals for the presence/absence of watermarks. We choose a particular spatial domain image watermarking algorithm and explain how the proposed framework can be applied to detect the watermark with very little knowledge about the watermark insertion procedure. The false alarm probability can be optimally traded-off with the probability of detection using the receiver operating characteristic function.","PeriodicalId":234597,"journal":{"name":"MULTIMEDIA '00","volume":"126 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115549469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '00Pub Date : 2000-11-04DOI: 10.1145/357744.357916
V. Talwar, K. Nahrstedt
{"title":"Securing RSVP for multimedia applications","authors":"V. Talwar, K. Nahrstedt","doi":"10.1145/357744.357916","DOIUrl":"https://doi.org/10.1145/357744.357916","url":null,"abstract":"Distributed multimedia applications require end-to-end quality of service (QoS) in order to be accepted and used. One approach to achieve end-to-end QoS is to provide end-to-end resource reservations. Resource ReSerVation Protocol (RSVP) [5] [1] is a unicast and multicast signalling protocol for setting up network bandwidth reservation. In this paper, we propose a solution for securing RSVP messages in a flexible, efficient and scalable manner. Our solution extends the RSVP protocol with a scalable QoS protection, using a hybrid hierarchical security approach. The RSVP messages go through two different protocol treatments - one within subnetworks and the other across subnetworks. We use delayed integrity checking within the subnetwork by sending feedback messages from the egress node. A stronger integrity and encryption check is made on messages sent across subnetworks. Our solution is thus an intermediate approach between the extremes of hop by hop authentication [2] and the SDS/CD protocol [8] and overcomes the drawbacks of the two protocols.","PeriodicalId":234597,"journal":{"name":"MULTIMEDIA '00","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114653235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '00Pub Date : 2000-11-04DOI: 10.1145/357744.357755
L. Tarabella, G. Bertini
{"title":"Giving expression to multimedia performance","authors":"L. Tarabella, G. Bertini","doi":"10.1145/357744.357755","DOIUrl":"https://doi.org/10.1145/357744.357755","url":null,"abstract":"In this paper we describe the experience of researchers and artists involved in the activities of the “computer ART lab” (cART lab) of the italian National Council of Research (C.N.R.) in Pisa, regarding the “wireless technology” developed for controlling in real-time and giving expression to interactive multimedia performances.\u0000Due to the daily increase in computers power and electronics systems able to sense the presence, the shape, the distance and the position of objects, a new field of investigation and implementation has been started in the last few years: computer recognition of human gesture [1][2]. As a result, the human body itself can now be considered as a natural and powerful expressive “interface” to give feeling to performances based on computer generated electro-acoustic music and computer generated visual-art. Modern human computer interfaces are extremely rich, incorporating traditional interface devices such as keyboard and mouse and a wealth of advanced media types: sound, video, animated graphics. The term multi-modal is often associated with such interfaces to emphasize that the combined use of multiple modes of perception is relevant to the user's interface [3][4].\u0000The most relevant devices and systems developed at cART lab for gesture recognition to be used in interactive multimedia performances are here reported.","PeriodicalId":234597,"journal":{"name":"MULTIMEDIA '00","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131723243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MULTIMEDIA '00Pub Date : 2000-11-04DOI: 10.1145/357744.357752
Teri Rueb
{"title":"Gathering crowds: mapping the post-human social body","authors":"Teri Rueb","doi":"10.1145/357744.357752","DOIUrl":"https://doi.org/10.1145/357744.357752","url":null,"abstract":"As a digital artist, I seek to create experiential works that engage participants in an infinite feedback loop of interaction and discovery. My goal is to create works that exist in a perpetual state of transformation and becoming. I am interested in the rhythm and music of everyday activities such as walking and driving—compositions that have no beginning, ending, inside or outside. Such movements, when captured and visualized, provide a portrait of the social body that reflects our understanding of Self and Other.","PeriodicalId":234597,"journal":{"name":"MULTIMEDIA '00","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131759954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}