11th International Multimedia Modelling Conference最新文献_第3页

Multilevel Quadratic Variation Minimization for 3D Face Modeling and Virtual View Synthesis 三维人脸建模与虚拟视图合成的多级二次变差最小化

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.55

Xiaozheng Zhang, Yongsheng Gao, M. Leung

引用次数: 12

Framework for Script Based Virtual Directing and Multimedia Authoring in Live Video Streaming 实时视频流中基于脚本的虚拟导演和多媒体创作框架

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.41

R. Xu, Jesse S. Jin, J. G. Allen

{"title":"Framework for Script Based Virtual Directing and Multimedia Authoring in Live Video Streaming","authors":"R. Xu, Jesse S. Jin, J. G. Allen","doi":"10.1109/MMMC.2005.41","DOIUrl":"https://doi.org/10.1109/MMMC.2005.41","url":null,"abstract":"We propose a novel framework that facilitates automatic editing and authoring of multimedia using static and moving cameras in a dynamic scene. The framework incorporates several video techniques such as object tracking using mean shift and object recognition using Scaled Invariant Feature Transform (SIFT). These techniques are linked together by a comprehensive yet simple-to-program script authoring mechanism based on video event detection. These combined features empower the system to play a virtual director role in live video stream editing and multimedia integration. The system requires minimum human intervention and can leverage production efficiency for both novice and professional users. The experimental results from our prototype system demonstrate that this framework is achievable using inexpensive hardware and standard video cameras. Our system provides comprehensive pre-production authoring capabilities that lend towards integration of video and heterogonous multimedia elements in realtime. We have found this framework to be useful in many applications such as live video streaming, distance education, live entertainment, sports coverage and personal video broadcasting.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131520148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

COSMOS-7: A Video Content Modeling Framework for MPEG-7 COSMOS-7:面向MPEG-7的视频内容建模框架

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.31

Athanasios C. Gkoritsas, M. Angelides

引用次数: 11

Effective Feature Extraction for Play Detection in American Football Video 橄榄球视频中有效的特征提取方法

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.37

Tie-Yan Liu, Wei-Ying Ma, HongJiang Zhang

引用次数: 26

Music Key Detection for Musical Audio 音乐关键检测的音乐音频

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.56

Yongwei Zhu, M. Kankanhalli, Sheng Gao

引用次数: 54

Generative and Discriminative Modeling toward Semantic Context Detection in Audio Tracks 基于生成和判别建模的音轨语义上下文检测

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.42

W. Chu, Wen-Huang Cheng, Ja-Ling Wu

{"title":"Generative and Discriminative Modeling toward Semantic Context Detection in Audio Tracks","authors":"W. Chu, Wen-Huang Cheng, Ja-Ling Wu","doi":"10.1109/MMMC.2005.42","DOIUrl":"https://doi.org/10.1109/MMMC.2005.42","url":null,"abstract":"Semantic-level content analysis is a crucial issue to achieve efficient content retrieval and management. We propose a hierarchical approach that models the statistical characteristics of several audio events over a time series to accomplish semantic context detection. Two stages, including audio event and semantic context modeling/testing, are devised to bridge the semantic gap between physical audio features and semantic concepts. For action movies we focused in this work, hidden Markov models (HMMs) are used to model four representative audio events, i.e. gunshot, explosion, car-braking, and engine sounds. At the semantic context level, generative (ergodic hidden Markov model) and discriminative (support vector machine, SVM) approaches are investigated to fuse the characteristics and correlations among various audio events, which provide cues for detecting gunplay and car-chasing scenes. The experimental results demonstrate the effectiveness of the proposed approaches and draw a sketch for semantic indexing and retrieval. Moreover, the differences between two fusion schemes are discussed to be the reference for future research.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130235117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

Feature Relevance Learning in Content-Based Image Retrieval Using GRA 基于GRA的基于内容的图像检索中的特征相关学习

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.40

Kui Cao

{"title":"Feature Relevance Learning in Content-Based Image Retrieval Using GRA","authors":"Kui Cao","doi":"10.1109/MMMC.2005.40","DOIUrl":"https://doi.org/10.1109/MMMC.2005.40","url":null,"abstract":"In the uncertain and incomplete system study, the Grey Relational Analysis(GRA) method in grey system theory throws emphasis on the problem of \"small-sized data samples, poor information and uncertainty\" which cannot be handled by traditional statistics. As user’s query requirement may be ambiguous and subjective sometimes in content-based image retrieval, the query results are uncertain to some extent; therefore, retrieval process can be treated as a grey system, and the query vectors and the weight values of image features as the grey numbers. So, it is a good approach for us to develop a relevance feedback technique for content-based image retrieval using the GRA method in grey system theory. In this paper, we propose a novel relevance feedback technique for content-based image retrieval using the GRA method in the grey system theory. The key idea of the proposed approach is the grey relational analysis of the feature distributions of images the user has judged relevant, in order to understand what features have been taken into account (and to what extent) by the user in formulating this judgment, so that we can accentuate the influence of these features in the overall evaluation of image similarity. The proposed method, which allows the user to retrieve the image database and progressively refine system’s response to the query by indicating the degree of relevance of retrieved images, dynamically updates the query vectors and the weights for similarity measure in order to accurately represent the user’s particular information needs. Experimental results show that the proposed approach captures the user’s information needs more precisely.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128815347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Retrieval of News Video Using Video Sequence Matching 基于视频序列匹配的新闻视频检索

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.63

Young-tae Kim, Tat-Seng Chua

引用次数: 58

Interactive Visual Retrieval System for Large Scale 3D Models Database 面向大型三维模型数据库的交互式可视化检索系统

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.50

Weibin Liu, Y. Uehara, Hao Yu, D. Masumoto, Yi Liu, Jiantao Pu, H. Zha

引用次数: 2

Meta Data Extraction from Linguistic Meeting Transcripts for the Annodex File Format 从语言会议记录中提取元数据的注释文件格式

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.53

Claudia Schremmer, S. Pfeiffer

{"title":"Meta Data Extraction from Linguistic Meeting Transcripts for the Annodex File Format","authors":"Claudia Schremmer, S. Pfeiffer","doi":"10.1109/MMMC.2005.53","DOIUrl":"https://doi.org/10.1109/MMMC.2005.53","url":null,"abstract":"Semantic interpretation of the data distributed over the Internet is subject to major current research activity. The Continuous Media Web (CMWeb) extends the World Wide Web to time-continuously sampled data such as audio and video in regard to the searching, linking, and browsing functionality. The CMWeb technology is based the file format Annodex which streams the media content interspersed with markup in the Continuous Media Markup Language (CMML) format that contains information relevant to the whole media file, e.g., title, author, language as well as time-sensitive information, e.g., topics, speakers, time-sensitive hyperlinks. The CMML markup may be generated manually or automatically. This paper investigates the automatic extraction of meta data and markup information from complex linguistic annotations, which are annotated recordings collected for use in linguistic research. We are particularly interested in annotated recordings of meetings and teleconferences and see automatically generated CMML files and their corresponding Annodex streams as one way of viewing such recordings. The paper presents some experiments with generating Annodex files from hand-annotated meeting recordings.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125344696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0