D. Saha, D. Kandlur, T. Barzilai, Zon-Yin Shae, M. Willebeek-LeMair
{"title":"A video conferencing testbed on ATM: design, implementation and optimizations","authors":"D. Saha, D. Kandlur, T. Barzilai, Zon-Yin Shae, M. Willebeek-LeMair","doi":"10.1109/MMCS.1995.484904","DOIUrl":"https://doi.org/10.1109/MMCS.1995.484904","url":null,"abstract":"This paper describes our experiences with the design and implementation of a very high-end video conferencing testbed on an ATM network. Our system is built on an IBM RISC System/6000 equipped with prototype hardware for video and audio capture and compression, and an IBM 100 Mb/s ATM adapter. In our early experiments we used UDP/IP running over ATM Adaptation Layer 5 (AAL5) for data transfer between peers. Our initial experiences with the system indicated that the overall system performance did not match our expectations even though most of the video, audio, and network processing was performed in hardware. A thorough profiling of the system revealed that the protocol processing and data handling overheads in the end-host are responsible for the poor video/audio quality. Based on these observations, we have proposed and implemented changes to the protocol data path that can significantly improve the performance of the system. Although we discuss our solution in the context of a video conferencing application, our approach is general and can be applied to many other applications. It is particularly useful for applications that are required to handle large volumes of time-critical data, such as multimedia servers.","PeriodicalId":423754,"journal":{"name":"Proceedings of the International Conference on Multimedia Computing and Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129744229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Design and evaluation of data access strategies in a high performance multimedia-on-demand server","authors":"D. Jadav, C. Srinilta, A. Choudhary, P. Berra","doi":"10.1109/MMCS.1995.484936","DOIUrl":"https://doi.org/10.1109/MMCS.1995.484936","url":null,"abstract":"One of the key components of a multi user multimedia on demand system is the data server. Digitization of traditionally analog data such as video and audio, and the feasibility of obtaining network bandwidths above the gigabit per second range are two important advances that have made possible the realization, in the near future, of interactive distributed multimedia systems. Secondary-to-main memory I/O technology has not kept pace with advances in networking, main memory and CPU processing power. Consequently, the performance of the server has a direct bearing on the overall performance of such a system. We develop a model for the architecture of a server for such a system. Parallelism of data retrieval is achieved by striping the data across multiple disks. The performance of any server ultimately depends on the data access patterns. Two modifications of the basic retrieval algorithm are presented to exploit data access patterns in order to improve system throughput and response time. A complementary information caching optimization is discussed. Finally, we present performance results of these algorithms on the IBM SP1 and Intel Paragon parallel computers.","PeriodicalId":423754,"journal":{"name":"Proceedings of the International Conference on Multimedia Computing and Systems","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124272567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"User recovery of audio operations","authors":"Vahid Mashayekhi, M. Maley, J. Riedl","doi":"10.1109/MMCS.1995.484944","DOIUrl":"https://doi.org/10.1109/MMCS.1995.484944","url":null,"abstract":"Computer interfaces that support user recovery can radically alter a user's interaction style. Users can explore alternatives freely, secure in the knowledge that they can undo actions and restore previous states if necessary. A text-editor, like EMACS, where users can restore the state of an editing session to a correct previous state, is an example of such a system. Editors for textual, graphical, and many other media types commonly support user recovery. Support for and understanding of recovery in applications that use audio is not as widespread. Audio is characterized by its large volume, lack of easy indexing, and difficulty in defining inverse operations. We present a theoretical model of recovery for audio operations to help user interface designers and implementers. Our model maps an audio operation to a recovery policy and then the recovery policy to a recovery mechanism. The model uses a classification of audio operations that aids in choosing applicable recovery policies.","PeriodicalId":423754,"journal":{"name":"Proceedings of the International Conference on Multimedia Computing and Systems","volume":"250 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123344751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Doing FLIPS: FLexible Interactive Presentation Synchronization","authors":"James A. Schnepf, J. Konstan, D. Du","doi":"10.1109/MMCS.1995.484926","DOIUrl":"https://doi.org/10.1109/MMCS.1995.484926","url":null,"abstract":"As multimedia presentation technology advances, it is possible to incorporate a wider range of media including variable duration media such as simulations and animations. At the same time, users are able to take more control over presentations by controlling the rate and selection of media being played. To make full use of these advances, multimedia systems must support flexible presentations that incorporate many variations in the way they are played. This paper identifies three requirements for flexible presentations and derives four requirements for synchronization of flexible presentations and how they can be achieved. The paper presents FLexible Interactive Presentation Synchronization (FLIPS), a model for specifying coarse synchronization for flexible presentations. FLIPS supports a wide range of temporal synchronization specifications and provides algorithms for attaining a consistent and coherent presentation state in response to user interaction (e.g., skipping to a different slide or selection) and other state-changing events. Applications of the FLIPS model are discussed.","PeriodicalId":423754,"journal":{"name":"Proceedings of the International Conference on Multimedia Computing and Systems","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116641780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Networking for success in cyberspace","authors":"R. Garud, A. Kumaraswamy, A. Prabhu","doi":"10.1109/MMCS.1995.484945","DOIUrl":"https://doi.org/10.1109/MMCS.1995.484945","url":null,"abstract":"Several key technologies are converging to create the emerging cyberspace. We characterize this convergence process as one of cumulative synthesis and suggest that the network mode of organization is the most appropriate for facilitating convergence.","PeriodicalId":423754,"journal":{"name":"Proceedings of the International Conference on Multimedia Computing and Systems","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114981606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
C. Chung, T. Shih, Jiung-yao Huang, Ying-Hong Wang, T.-F. Kuo
{"title":"An object-oriented approach and system for intelligent multimedia presentation designs","authors":"C. Chung, T. Shih, Jiung-yao Huang, Ying-Hong Wang, T.-F. Kuo","doi":"10.1109/MMCS.1995.484934","DOIUrl":"https://doi.org/10.1109/MMCS.1995.484934","url":null,"abstract":"Many presentation or authoring tools were developed for presenters or artists in various fields. However, presentations created by these tools were either communicating with its addressees in a single direction, or providing limited navigation controls for the audiences via push buttons or menus. These presentations cannot incorporate addressees' responses. As a result, an audience watches the same demonstration over and over again even if he/she has told the computer that the topic is understood. We introduce a multimedia presentation design system that allows a presenter to plan the audience's reaction in advance. While the audience is watching a presentation, the underlying inference system is learning from their responses. This mechanism makes a presentation proceed again and act according to the audience's background and knowledge. Thus, the resulting presentation is more diversified.","PeriodicalId":423754,"journal":{"name":"Proceedings of the International Conference on Multimedia Computing and Systems","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126444232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Young Francis Day, S. Dagtas, Mitsutoshi Iino, A. Khokhar, A. Ghafoor
{"title":"Spatio-temporal modeling of video data for on-line object-oriented query processing","authors":"Young Francis Day, S. Dagtas, Mitsutoshi Iino, A. Khokhar, A. Ghafoor","doi":"10.1109/MMCS.1995.484913","DOIUrl":"https://doi.org/10.1109/MMCS.1995.484913","url":null,"abstract":"The paper presents a framework for data modeling and semantic abstraction of image/video data. The framework is based on spatio-temporal information associated with salient objects in an image or in a sequence of video frames and on a set of generalized n-ary operators defined to specify spatial and temporal relationships of objects present in the data. The methodology presented in this paper can manifest itself effectively in conceptualizing events and heterogeneous views in multimedia data as perceived by individual users. The proposed paradigm induces a multilevel indexing and searching mechanism that models information at various levels of granularity and hence allows processing of content-based queries in real time. We also devise a unified object-oriented interface for users with heterogeneous views to specify queries on the unbiased encoded data. This framework is being developed to realize a highly integrated multimedia database architecture.","PeriodicalId":423754,"journal":{"name":"Proceedings of the International Conference on Multimedia Computing and Systems","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126765445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Presentation layer primitives for the layered multimedia data model","authors":"Gerhard A. Schloss, Michael J. Wynblatt","doi":"10.1109/MMCS.1995.484928","DOIUrl":"https://doi.org/10.1109/MMCS.1995.484928","url":null,"abstract":"The Layered Multimedia Data Model (LMDM) adds structure to the problem of specifying multimedia compositions by dividing the task into smaller, more manageable pieces. This paper outlines the basic constructs of the third LMDM layer, the Data Presentation Layer (DPL). The DPL allows specification of multimedia presentations, which describe the audio and visual display of temporally linked data objects specified at the lower layers. The strengths of the DPL, include: reusability of presentation templates, a general model of media synchronization, acknowledging and limiting system dependencies, and generalization of a traditional animation model.","PeriodicalId":423754,"journal":{"name":"Proceedings of the International Conference on Multimedia Computing and Systems","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127623244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A unified approach to temporal segmentation of motion JPEG and MPEG compressed video","authors":"B. Yeo, Bede Liu","doi":"10.1109/MMCS.1995.484911","DOIUrl":"https://doi.org/10.1109/MMCS.1995.484911","url":null,"abstract":"A common framework for rapid scene analysis for detecting scene changes in compressed motion JPEG and MPEG videos is proposed. We develop algorithms to detect both abrupt and gradual scene changes. The algorithms operate directly on the DC sequence which can be easily extracted from motion JPEG and MPEG compressed video without decompression. The DC images capture most of the essential \"global\" information, but is of a small fraction of the original data size. Operating on these images offers significant computation savings. Experimental results show that the proposed algorithms are fast and effective in detecting abrupt scene changes and gradual transitions.","PeriodicalId":423754,"journal":{"name":"Proceedings of the International Conference on Multimedia Computing and Systems","volume":"166 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134418391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
N. Shivakumar, C. Sreenan, B. Narendran, P. Agrawal
{"title":"The Concord algorithm for synchronization of networked multimedia streams","authors":"N. Shivakumar, C. Sreenan, B. Narendran, P. Agrawal","doi":"10.1109/MMCS.1995.484905","DOIUrl":"https://doi.org/10.1109/MMCS.1995.484905","url":null,"abstract":"Synchronizing different data streams from multiple sources simultaneously at a receiver is one of the basic problems involved in multimedia distributed systems. This requirement stems from the nature of packet based networks which can introduce end-to-end delays that vary both within and across streams. We present a new algorithm called Concord, which provides an integrated solution for these single and multiple stream synchronization problems. It is notable because it defines a single framework to deal with both problems, and operates under the influence of parameters which can be supplied by the application involved. In particular these parameters are used to allow a trade-off between the packet loss rates, total end-to-end delay and skew for each of the streams. For applications like conferencing this is used to reduce delay by determining the minimum buffer delay/size required.","PeriodicalId":423754,"journal":{"name":"Proceedings of the International Conference on Multimedia Computing and Systems","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121470534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}