{"title":"Client based synchronization control of coded data streams","authors":"Mourad Daami, N. Georganas","doi":"10.1109/MMCS.1997.609643","DOIUrl":"https://doi.org/10.1109/MMCS.1997.609643","url":null,"abstract":"In a multimedia news on demand application, the delivery of multiple streams of data over a network will introduce inevitable delays and delay variations that disrupt both inter- and intra-media synchronization. Therefore, a complete stream synchronization protocol must be specified and tested to ensure proper rendering of the multimedia presentation at the client. In such a protocol, special treatment should be considered for time dependent coded data streams due to the presence of a hardware/software decoding entity at the client. We propose a client based control scheme using buffer occupancy as its main parameter. The control scheme uses time contraction and expansion concepts to: avoid data overflow or underflow conditions; and enforce intra-media synchrony of the data stream. We refer to such a control mechanism as the predecoder synchronization controller. We consider both motion JPEG and MPEG video bit streams and discuss the implementation and architectural constraints encountered.","PeriodicalId":302885,"journal":{"name":"Proceedings of IEEE International Conference on Multimedia Computing and Systems","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124915385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Subramanya, R. Simha, B. Narahari, Abdou Youssef
{"title":"Transform-based indexing of audio data for multimedia databases","authors":"S. Subramanya, R. Simha, B. Narahari, Abdou Youssef","doi":"10.1109/MMCS.1997.609595","DOIUrl":"https://doi.org/10.1109/MMCS.1997.609595","url":null,"abstract":"Since the relative proportion of multimedia (video, image and audio) data within databases is expected to increase substantially in the future, keyword-based indexing would be inadequate and efficient content-based query and retrieval are required. The problem of devising content based query, indexing, and retrieval for these newer data types remains an open and challenging problem. While considerable attention has recently been given to image (and, to some extent, video) indexing, much less has been devoted to the problem of indexing its unidimensional counterpart-audio data. The paper proposes content-based indexing schemes for audio data in multimedia databases. The methods are based on transform techniques used in signal processing which transform data from time (or spatial) domain to frequency domain. This offers many advantages such as easy removal of noise, efficient compression and different types of processing. Two algorithms for indexing are presented along with experimental results.","PeriodicalId":302885,"journal":{"name":"Proceedings of IEEE International Conference on Multimedia Computing and Systems","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125124366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Interactive scheduling supports for real-time multimedia execution","authors":"T. Tan, W. Hsu","doi":"10.1109/MMCS.1997.609774","DOIUrl":"https://doi.org/10.1109/MMCS.1997.609774","url":null,"abstract":"With the widespread use of multimedia in industry, there is a need to re-study the underlying computer system structure to better support the execution of multimedia applications. Traditional schedulers found in general-purpose operating systems (GPOS) like UNIX cannot give sufficient and timely service to the multimedia applications. Though there are many hard real-time systems available, they emphasize predictability and absolute guarantee of schedules. These should not be the primary concerns for multimedia applications where missing a deadline is not fatal. To address the special needs of multimedia scheduling, the authors have designed and implemented a flexible scheduling testbed, called ICute (Interactive exeCUTE). The objectives of ICute are: (1) to implement a flexible scheduling mechanism that is able to regulate the execution of multimedia applications to achieve optimal output under various load conditions, (2) to present the programmer with a suitable environment for developing and executing multimedia applications by abstracting all the internal process management and scheduling from the programmer, and (3) to improve portability since ICute is implemented on a UNIX platform, the codes are portable across many systems.","PeriodicalId":302885,"journal":{"name":"Proceedings of IEEE International Conference on Multimedia Computing and Systems","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127939820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Synchronization of multimedia streams in distributed environments","authors":"E. Stoica, H. Abdel-Wahab, K. Maly","doi":"10.1109/MMCS.1997.609646","DOIUrl":"https://doi.org/10.1109/MMCS.1997.609646","url":null,"abstract":"A major feature that distinguishes multimedia applications from other traditional applications is the integration of various media streams that have to be presented in a synchronized fashion. We present a set of algorithms for achieving fine-grain synchronization in a best-effort distributed system. Our algorithms, based on numerical time stamps, take into account both the possible frame loss due to the device buffer overflow at the sender, and the display time of a video frame at the receiver. As a proof of concept, we have implemented a robust solution for lip-synchronization. Experimental results show that the skew between audio and video was between the acceptable values. We extended the solution to n continuous stream synchronization that has the same or different sources. A direct application of our research is the Interactive Remote Instruction (IRI) project, which requires synchronization algorithms in a distance learning environment where participants use best-effort operating systems and networks.","PeriodicalId":302885,"journal":{"name":"Proceedings of IEEE International Conference on Multimedia Computing and Systems","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128727802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Scheduling for interactive operations in parallel video servers","authors":"Minyou Wu, W. Shu","doi":"10.1109/MMCS.1997.609591","DOIUrl":"https://doi.org/10.1109/MMCS.1997.609591","url":null,"abstract":"Providing efficient support for interactive operations such as fast-forward and fast-backward is essential in video-on-demand and other multimedia server systems. The authors present two basic approaches for scheduling interactive operations, the prefetching approach and the grouping approach. Scheduling algorithms are presented for both fine-grain and coarse-grain data blocks. These algorithms can precisely schedule video streams for both normal playout and interactive operations.","PeriodicalId":302885,"journal":{"name":"Proceedings of IEEE International Conference on Multimedia Computing and Systems","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122098980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sangyup Shim, Tai-Sheng Chang, Yuewei Wang, J. Hsieh, D. Du
{"title":"Supporting continuous media: is Serial Storage Architecture (SSA) better than SCSI?","authors":"Sangyup Shim, Tai-Sheng Chang, Yuewei Wang, J. Hsieh, D. Du","doi":"10.1109/MMCS.1997.609795","DOIUrl":"https://doi.org/10.1109/MMCS.1997.609795","url":null,"abstract":"The existing SCSI (Small Computer System Interface) parallel bus has been widely used in supporting multimedia applications. However, it may not fully utilize the aggregate throughput from disks in supporting continuous media because of its unfair bus accesses. The emerging serial interface, Serial Storage Architecture (SSA), provides high data bandwidth, fair accesses and fault tolerance. The fairness algorithm in SSA ensures that a fraction of the data bandwidth is allocated to each disk. In this paper, we study the performance of SSA and SCSI in supporting continuous media.","PeriodicalId":302885,"journal":{"name":"Proceedings of IEEE International Conference on Multimedia Computing and Systems","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122528052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A real-time video-image mapping using polygon rendering techniques","authors":"T. Ikedo","doi":"10.1109/MMCS.1997.609585","DOIUrl":"https://doi.org/10.1109/MMCS.1997.609585","url":null,"abstract":"This work proposes a new hardware architecture for a video-mapping processor, a key technology for generation of realistic visual images in the multimedia age. The processor combines a polygon renderer, reverse-projection processor, video-pattern cache, pattern scaler, shading processor and pixel cache. Real-time mapping of refreshed video images into animated computer graphic images is performed in parallel with a 3.8 ns max/mapped pixel. The system is implemented in the Truga001 singlechip graphics processor of 940,000 gates in 0.3μ CMOS, developed at the University of Aizu. This paper describes the mechanism of video mapping, architecture and performance evaluation.","PeriodicalId":302885,"journal":{"name":"Proceedings of IEEE International Conference on Multimedia Computing and Systems","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123469671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Traffic control mechanism to support video multicast over IP networks","authors":"R. Ramanujan, Atiq Ahamad, K. Thurber","doi":"10.1109/MMCS.1997.609576","DOIUrl":"https://doi.org/10.1109/MMCS.1997.609576","url":null,"abstract":"A major impediment to the development of networked video applications is the lack of adequate support for video multicast within the Internet Protocol (IP). This paper presents an approach for extending the capability of IP to support multicast of video streams in a heterogeneous networking environment where there are static variations in the capacities of the links and the video quality needs of the receivers as well as dynamic variations in the available bandwidth of the network links. The approach, called IP extensions for video multicast (IPEVM), uses an innovative traffic control mechanism within the network routers that dynamically adapts the quality of the video to changes in the available network bandwidth while ensuring fair treatment of all traffic contending for the network's best effort service. It also permits an application-selectable mix of resource reservation and adaptive bandwidth management to be used in building networked video applications.","PeriodicalId":302885,"journal":{"name":"Proceedings of IEEE International Conference on Multimedia Computing and Systems","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126436476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Speech recognition on MPEG/Audio encoded files","authors":"L. Yapp, G. Zick","doi":"10.1109/MMCS.1997.609787","DOIUrl":"https://doi.org/10.1109/MMCS.1997.609787","url":null,"abstract":"A technique to perform speech recognition directly from audio files encoded using the MPEG/Audio coding standard is described. The technique works in the compressed domain and does not require the MPEG/Audio file to be decompressed. Only the encoded subband samples are extracted and processed for training and recognition. The underlying speech recognition engine used is based on the hidden Markov model. The technique is applicable to layers I and II of MPEG/Audio, and training under one layer can be used to recognize the other. Results based on the recognition of a speaker-dependent, small vocabulary, and continuously spoken sentences shows accuracy as high as 99% using this technique.","PeriodicalId":302885,"journal":{"name":"Proceedings of IEEE International Conference on Multimedia Computing and Systems","volume":"340 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133929575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Dynamic QoS management for real-time communication in ATM networks","authors":"M. Woo, H. Lim, Hong-Ling Low, C. Chee","doi":"10.1109/MMCS.1997.609804","DOIUrl":"https://doi.org/10.1109/MMCS.1997.609804","url":null,"abstract":"We propose an ATM-based solution to efficient real-time network data transfer, by mapping one dynamic channel to one or more static channels at an end-host, each of which can be individually closed to decrease the overall bandwidth. New static channels can be added to increase the overall bandwidth. Simulations of this technique have shown that reduction of bandwidth wastage becomes more evident as the range of bandwidth specified by the user increases.","PeriodicalId":302885,"journal":{"name":"Proceedings of IEEE International Conference on Multimedia Computing and Systems","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131827629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}