MULTIMEDIA '01最新文献

筛选
英文 中文
A compressed domain beat detector using MP3 audio bitstreams 使用MP3音频比特流的压缩域节拍检测器
MULTIMEDIA '01 Pub Date : 2001-10-01 DOI: 10.1145/500141.500172
Ye-Kui Wang, M. Vilermo
{"title":"A compressed domain beat detector using MP3 audio bitstreams","authors":"Ye-Kui Wang, M. Vilermo","doi":"10.1145/500141.500172","DOIUrl":"https://doi.org/10.1145/500141.500172","url":null,"abstract":"This paper presents a novel beat detector that processes MPEG-1 Layer III (known as MP3) encoded audio bitstreams directly in the compressed domain. Most previous beat detection or tracking systems dealing with MIDI or PCM signals are not directly applicable to compressed audio bitstreams, such as MP3 bitstreams. We have developed the beat detector as a part of a beat-pattern based error concealment scheme for streaming music over error prone channels. Special effort was used to obtain a tailored trade-off between performance, complexity and memory consumption for this specific application. A comparison between the machine-detected results to the human annotation has shown that the proposed method correctly tracked beats in 4 out of 6 popular music test signals. The results were analyzed.","PeriodicalId":416848,"journal":{"name":"MULTIMEDIA '01","volume":"179 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125351434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 45
Automated authoring of coherent multimedia discourse in conversation systems 会话系统中连贯多媒体话语的自动创作
MULTIMEDIA '01 Pub Date : 2001-10-01 DOI: 10.1145/500141.500241
Michelle X. Zhou, Shimei Pan
{"title":"Automated authoring of coherent multimedia discourse in conversation systems","authors":"Michelle X. Zhou, Shimei Pan","doi":"10.1145/500141.500241","DOIUrl":"https://doi.org/10.1145/500141.500241","url":null,"abstract":"We are building a full-fledged multimedia conversation framework called Responsive Information Architect (RIA), using a combination of AI and multimedia techniques. Here we describe RIA's capability of automated authoring of a coherent multimedia discourse, which is used by RIA to express itself when conversing with a user. Specifically, we focus on explaining three unique features of our automated authoring approach: automated authoring of multimedia inter¿action acts, dynamic insertion of multimedia punctuation acts, and systematic design of cross-media acts.","PeriodicalId":416848,"journal":{"name":"MULTIMEDIA '01","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127013473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Demonstration of improved multimedia streaming by using content-aware video scaling 通过使用内容感知视频缩放演示改进的多媒体流
MULTIMEDIA '01 Pub Date : 2001-10-01 DOI: 10.1145/500141.500259
Avanish Tripathi, M. Claypool
{"title":"Demonstration of improved multimedia streaming by using content-aware video scaling","authors":"Avanish Tripathi, M. Claypool","doi":"10.1145/500141.500259","DOIUrl":"https://doi.org/10.1145/500141.500259","url":null,"abstract":"","PeriodicalId":416848,"journal":{"name":"MULTIMEDIA '01","volume":"2015 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127728879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Motion based object tracking in MPEG-2 stream for perceptual region discriminating rate transcoding 基于运动目标跟踪的MPEG-2流感知区域识别率转码
MULTIMEDIA '01 Pub Date : 2001-10-01 DOI: 10.1145/500141.500245
J. Khan, Z. Guo, Wansik Oh
{"title":"Motion based object tracking in MPEG-2 stream for perceptual region discriminating rate transcoding","authors":"J. Khan, Z. Guo, Wansik Oh","doi":"10.1145/500141.500245","DOIUrl":"https://doi.org/10.1145/500141.500245","url":null,"abstract":"Object based bit allocation can result in significant improvement in the perceptual quality of relatively low bit-rate video. In this paper we describe a novel content aware video transcoding technique that can accept high-level description of video objects and extract them from incoming video stream and use it for perceptual encoding based extreme video downscaling.","PeriodicalId":416848,"journal":{"name":"MULTIMEDIA '01","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127964043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Consistency control for distributed interactive media 分布式交互媒体的一致性控制
MULTIMEDIA '01 Pub Date : 2001-10-01 DOI: 10.1145/500141.500176
J. Vogel, M. Mauve
{"title":"Consistency control for distributed interactive media","authors":"J. Vogel, M. Mauve","doi":"10.1145/500141.500176","DOIUrl":"https://doi.org/10.1145/500141.500176","url":null,"abstract":"In this paper we present a generic consistency control service for distributed interactive media, i.e. media which allow a distributed group of users to interact with the medium itself. Consistency control is vital to these media since they typically require that a local copy of the medium's state be maintained by each user's application. Our service helps the applications to keep the local state copies consistent. The main characteristics of this service are as follows: a significant number of inconsistencies are prevented by using a mechanism called local lag. Inconsistencies that cannot be prevented are repaired by an improved timewarp algorithm that can be executed locally without burdening the network or the applications of other users. Exceptional situations and consistency during late-join situations are supported by a consistent state request mechanism. Moreover, the service also supports the application in detecting intention conflicts between the actions of distinct users. The major part of this functionality is based on a media model and the application level protocol for distributed interactive media (RTP/I) and can thus be reused by arbitrary RTP/I-based applications. In order to demonstrate the feasibility of our approach and to evaluate its performance we have integrated the generic consistency service into a shared whiteboard system.","PeriodicalId":416848,"journal":{"name":"MULTIMEDIA '01","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128202841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 57
Java multimedia telecollaboration Java多媒体远程协作
MULTIMEDIA '01 Pub Date : 2001-10-01 DOI: 10.1145/500141.500258
J. Oliveira, F. Malric, Dongsheng Yang, S. Nourian, N. Georganas
{"title":"Java multimedia telecollaboration","authors":"J. Oliveira, F. Malric, Dongsheng Yang, S. Nourian, N. Georganas","doi":"10.1145/500141.500258","DOIUrl":"https://doi.org/10.1145/500141.500258","url":null,"abstract":"Many critics of the Java language have pointed performance deficiencies as a barrier to acceptable support of Collaborative Multimedia Systems. This paper presents four MCRLab products, fully written in Java that support collaboration amongst a group of users, and successfully demonstrate that Java collaboration frameworks are not only possible but also can perform at acceptable levels of quality, if designed correctly.","PeriodicalId":416848,"journal":{"name":"MULTIMEDIA '01","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130274805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Classification of summarized videos using hidden markov models on compressed chromaticity signatures 利用隐马尔可夫模型对压缩色度特征进行视频分类
MULTIMEDIA '01 Pub Date : 2001-10-01 DOI: 10.1145/500141.500217
Cheng Lu, M. S. Drew, J. Au
{"title":"Classification of summarized videos using hidden markov models on compressed chromaticity signatures","authors":"Cheng Lu, M. S. Drew, J. Au","doi":"10.1145/500141.500217","DOIUrl":"https://doi.org/10.1145/500141.500217","url":null,"abstract":"Tools for efficiently summarizing and classifying video sequences are indispensable to assist in the synthesis and analysis of digital video. In this paper, we present a method for effective classification of different types of videos that uses the output of a concise video summarization technique that forms a list of keyframes. The summarization is produced by a method recently presented, in which we generate a universal basis on which to project a video frame feature that effectively reduces any video to the same lighting conditions. Each frame is represented by a compressed chromaticity signature. A multi-stage hierarchical clustering method efficiently summarizes any video. Here, we classify TV programs using a trained hidden Markov model, using the keyframe plus temporal features generated in the summaries.","PeriodicalId":416848,"journal":{"name":"MULTIMEDIA '01","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114952644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 44
Panoramic video capturing and compressed domain virtual camera control 全景视频捕获与压缩域虚拟摄像机控制
MULTIMEDIA '01 Pub Date : 2001-10-01 DOI: 10.1145/500141.500191
Xinding Sun, J. Foote, Don Kimber, B. S. Manjunath
{"title":"Panoramic video capturing and compressed domain virtual camera control","authors":"Xinding Sun, J. Foote, Don Kimber, B. S. Manjunath","doi":"10.1145/500141.500191","DOIUrl":"https://doi.org/10.1145/500141.500191","url":null,"abstract":"A system for capturing panoramic video and a novel method for corresponding compressed domain virtual camera control is presented. It targets applications such as classroom lectures and video conferencing. The proposed method is based on the FlyCam panoramic video system that is designed to produce high resolution and wide-angle video sequences by stitching the video pictures from multiple stationary cameras. The panoramic video sequence is compressed into an MPEG-2 stream for delivery. The proposed method integrates region of Interest (ROI) detection, tracking, and virtual camera control, and works on compressed domain information only. It first detects the ROI in the P (predictive coded) picture using only the macroblock type information, It then up-samples this detection result to obtain the ROI of the whole video stream. The ROI is tracked using a Kalman filter. The Kalman filter estimation results are used for virtual camera control that simulates human controlled video recording. The system has no physical camera motion and the virtual camera parameters are readily available for video indexing. The proposed system has been implemented for real time processing.","PeriodicalId":416848,"journal":{"name":"MULTIMEDIA '01","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127257649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 37
LinStar texture: a fuzzy logic CBIR system for textures LinStar纹理:一个纹理的模糊逻辑CBIR系统
MULTIMEDIA '01 Pub Date : 2001-10-01 DOI: 10.1145/500141.500223
Hsin-Chih Lin, Chih-Yi Chiu, Shin-Nine Yang
{"title":"LinStar texture: a fuzzy logic CBIR system for textures","authors":"Hsin-Chih Lin, Chih-Yi Chiu, Shin-Nine Yang","doi":"10.1145/500141.500223","DOIUrl":"https://doi.org/10.1145/500141.500223","url":null,"abstract":"In this study, we propose a fuzzy logic CBIR system for textures, named LinStar Texture (i.e., Linguistic Star for Textures). The proposed system consists of two major phases, including database creation and query comparison. In the database creation phase, six Tamura features are extracted to describe each texture image in the database. A term set on each Tamura feature is generated through a fuzzy clustering algorithm so that degrees of appearance for the feature can be interpreted as five linguistic terms. In the query comparison phase, a user can pose textual descriptions or visual examples to find the desired textures. Furthermore, the query can be expressed as a logic composition of linguistic terms or Tamura feature values. The final similarity is then computed by aggregating each individual similarity through min-max composition rules. Experimental results reveal the proposed system is indeed effective. The retrieved images are perceptually satisfactory. The retrieval time is very fast.","PeriodicalId":416848,"journal":{"name":"MULTIMEDIA '01","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127880464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
The evolutionary sound synthesis method 进化声音合成法
MULTIMEDIA '01 Pub Date : 2001-10-01 DOI: 10.1145/500141.500248
J. Manzolli, A. Maia, José Fornari, F. Damiani
{"title":"The evolutionary sound synthesis method","authors":"J. Manzolli, A. Maia, José Fornari, F. Damiani","doi":"10.1145/500141.500248","DOIUrl":"https://doi.org/10.1145/500141.500248","url":null,"abstract":"A mathematical model for interactive sound synthesis based on the application of Genetic Algorithms (GA) is presented. The Evolutionary Sound Synthesis Method (ESSynth) generates sequences of waveform variants by the application of genetic operators on an initial population of waveforms. We describe how the waveforms can be treated as genetic code, the fitness evaluation methodology and how genetic operations such as crossover and mutation are used to produce generations of waveforms. Finally, we discuss the results evaluating the generated sounds.","PeriodicalId":416848,"journal":{"name":"MULTIMEDIA '01","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127897043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信