{"title":"A query model for retrieving relevant intervals within a video stream","authors":"S. Pradhan, Keishi Tajima, Katsumi Tanaka","doi":"10.1109/MMCS.1999.778586","DOIUrl":"https://doi.org/10.1109/MMCS.1999.778586","url":null,"abstract":"The nature of video is such that even an hour long video data may contain a large number of meaningful intervals. Manual identification of all such intervals is practically infeasible. There has been some success in automatically parsing and indexing video data through the integration of technologies such as image processing, speech/character recognition, and natural language understanding. However, even by applying such techniques, complete identification of all the intervals required for answering all possible queries cannot be achieved. As a result, using the current state-of-art techniques, whether automatic or manual, it is only fragmentary video intervals that can be successfully indexed. Our goal is to retrieve meaningful intervals within such fragmentarily indexed video streams. We propose a new set of algebraic operations which enable us to compose all the intervals that are conceivably relevant to a query. Since these operations may compose even irrelevant intervals, we provide a mechanism to exclude as many of them as possible from the answer set.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"355 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133848335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Key independent watermark detection","authors":"R. V. Schyndel, A. Tirkel, I. Svalbe","doi":"10.1109/MMCS.1999.779265","DOIUrl":"https://doi.org/10.1109/MMCS.1999.779265","url":null,"abstract":"Many types of pseudo-random signals have been used to embed signatures as watermarks, with spread spectrum signal techniques used to recover the signature from the encrypted data. Legendre sequences are a suitable candidate for signature encryption as they exhibit 'perfect' two level auto-correlation. Additionally Legendre sequences have the unusual and interesting property of invariance under Fourier transformation; the spatial and frequency representation of each sequence is identical up to a phase factor. The presence of a Legendre-based watermark, embedded in the pixel or transform domain, can be detected by cross-correlating a sequence-encrypted image with its Fourier transform. This property enables verification of the presence of a watermark (of specified length), without requiring prior knowledge of the sequence type or key used for the encryption.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114796925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Spatial color indexing: a novel approach for content-based image retrieval","authors":"Y. Tao, W. Grosky","doi":"10.1109/MMCS.1999.779257","DOIUrl":"https://doi.org/10.1109/MMCS.1999.779257","url":null,"abstract":"The paper examines the use of a computational geometry based spatial color indexing methodology for efficient and effective image retrieval. In this scheme, an image is evenly divided into a number of M*N non overlapping blocks, and each individual block is abstracted as a unique feature point labeled with its spatial location, dominant hue, and dominant saturation. For each set of feature points labeled with the same hue or saturation, we construct a Delaunay triangulation and then complete the feature point histogram by discretizing and counting the angles produced by this triangulation. The concatenation of all these feature point histograms serves as the image index. An important contribution of this work is to encode the spatial color information using geometric triangulation, which is translation, rotation, and scale independent. We have implemented the proposed approach and have tested it over two image collections of 2000 JPEG images and 1380 GIF images. Various experimental results demonstrate the efficacy of our techniques.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"113 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117293428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A DCT-domain H.263 based video combiner for multipoint continuous presence video conferencing","authors":"Da-Jin Shiu, Chia-Chiang Ho, Ja-Ling Wu","doi":"10.1109/MMCS.1999.778143","DOIUrl":"https://doi.org/10.1109/MMCS.1999.778143","url":null,"abstract":"This paper proposes an H.263 based DCT-domain video combiner which is suitable for a multipoint continuous presence videoconference system and supports up to six conferees. The main issues of the H.263 video combiner are discussed. A software-based combiner is implemented and tested for various test sequences. The combined videos have promising quality and the combiner is considered very efficient for practical usage.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"122 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117315776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
C. Brambilla, A. Ventura, I. Gagliardi, R. Schettini
{"title":"Multiresolution wavelet transform and supervised learning for content-based image retrieval","authors":"C. Brambilla, A. Ventura, I. Gagliardi, R. Schettini","doi":"10.1109/MMCS.1999.779144","DOIUrl":"https://doi.org/10.1109/MMCS.1999.779144","url":null,"abstract":"We focus on the definition of an effective strategy that allows the user to pose a visual query and retrieve a set of images from a database that satisfy his criteria of pictorial similarity without requiring any semantic expression of them. The strategy exploits a multiresolution wavelet transform to effectively describe image content. The salient features of the images are coded in signatures of predefined lengths which are compared in the retrieval phase by applying a similarity measure the system has pre-learned, using a regression model for ordinal responses, from a learning set of \"very similar\", \"rather-similar\", \"not-very-similar\", and \"different\" pairs of images. Some experimental results demonstrating the effectiveness of this approach are reported.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123562518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Managing large scale broadband multimedia services on distributed media servers","authors":"R. Luling","doi":"10.1109/MMCS.1999.779224","DOIUrl":"https://doi.org/10.1109/MMCS.1999.779224","url":null,"abstract":"The paper presents algorithms and principles for the management of large scale broadband multimedia services. For the implementation of these services, we use a network of distributed media servers, storing broadband media information e.g. audio and video that are streamed from the server to the connected clients. The problem studied in the paper is the managing of media content on a server network. We present the \"Distributed Server Management System (DSMS)\" that performs an efficient content and service management on the distributed servers. The DSMS allows one to collect access patterns from the users of the multimedia services and uses this knowledge for the solution of some combinatorial optimization algorithms that have to be solved for an efficient assignment of media assets. We present these optimization algorithms and evaluate their performance using some benchmark instances.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123921785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Video corpus construction and analysis","authors":"T. Satou, Akihito Akutsu, Yoshinobu Tonomura","doi":"10.1109/MMCS.1999.778512","DOIUrl":"https://doi.org/10.1109/MMCS.1999.778512","url":null,"abstract":"This paper proposes video corpus analysis as a new approach to video handling. The purpose of this approach is to discover frequent and characteristic video expressions from a large amount of video data. A video corpus has been built and currently consists of about 180 hours of MPEG-2 encoded video data, automatically extracted characteristics, and manually tagged attributes. These data include shot boundaries, camera operations, transition time and types between shots, text appearance in video, and thumbnail video frame images. Various tools are developed to enter, analyze, and visualize the video data and attributes. This paper mentions early results; analysis of the video corpus using N-gram statistics of the frame images, probabilities of attributes, and distribution of text appearance timing, reveals some interesting video expressions and usages that can be adopted for video handling.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124792086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Pereira, Joseph Ó Ruanaidh, F. Deguillaume, G. Csurka, T. Pun
{"title":"Template based recovery of Fourier-based watermarks using log-polar and log-log maps","authors":"S. Pereira, Joseph Ó Ruanaidh, F. Deguillaume, G. Csurka, T. Pun","doi":"10.1109/MMCS.1999.779316","DOIUrl":"https://doi.org/10.1109/MMCS.1999.779316","url":null,"abstract":"Digital watermarks have been proposed as a method for discouraging illicit copying and distribution of copyrighted material. The paper describes a method for the secure and robust copyright protection of digital images. We present an approach for embedding a digital watermark into an image using the fast Fourier transform. To this watermark is added a template in the Fourier transform domain, to render the method robust against rotations and scaling, or aspect ratio changes. We detail an algorithm based on the log-polar or log-log maps for the accurate and efficient recovery of the template in a rotated and scaled image. We also present results which demonstrate the robustness of the method against some common image processing operations such as compression, rotation, scaling and aspect ratio changes.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"2013 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128176913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Evaluation of copyright marking systems","authors":"F. Petitcolas, Ross J. Anderson","doi":"10.1109/MMCS.1999.779264","DOIUrl":"https://doi.org/10.1109/MMCS.1999.779264","url":null,"abstract":"Hidden copyright marks have been proposed as a solution for solving the illegal copying and proof of ownership problems in the context of multimedia objects. Many systems have been proposed, but it is still difficult to have even a rough idea of their performance and hence to compare them. So we first describe some general attacks on audio and image marking systems. Then we propose a benchmark to compare image marking software on a fair basis. This benchmark is based on a set of attacks that any system ought to survive.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127118542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Scheduling of adaptive multimedia documents","authors":"Stefan Wirag","doi":"10.1109/MMCS.1999.778403","DOIUrl":"https://doi.org/10.1109/MMCS.1999.778403","url":null,"abstract":"Since multimedia documents may comprise continuous media, such as audio and video, the presentation of those documents may require a significant amount of processing and network resources. Depending on the system configuration and the current system load, it can happen that there are not enough resources to render a multimedia document according to the specification, resulting in a reduced presentation quality. To cope with those situations, documents can be specified as flexible so that they can be adapted to different system configurations and load conditions. We present an adaptive scheduling algorithm which allows to adapt documents conforming to our document model Tiempo in environments with best-effort assignment of resources.","PeriodicalId":408680,"journal":{"name":"Proceedings IEEE International Conference on Multimedia Computing and Systems","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125673088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}