11th International Multimedia Modelling Conference最新文献

筛选
英文 中文
Audio Indexing for Efficient Music Information Retrieval 高效音乐信息检索的音频索引
11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.22
Ioannis Karydis, A. Nanopoulos, A. Papadopoulos, Y. Manolopoulos
{"title":"Audio Indexing for Efficient Music Information Retrieval","authors":"Ioannis Karydis, A. Nanopoulos, A. Papadopoulos, Y. Manolopoulos","doi":"10.1109/MMMC.2005.22","DOIUrl":"https://doi.org/10.1109/MMMC.2005.22","url":null,"abstract":"This paper presents an algorithm that efficiently retrieves audio data similar to an audio query. The proposed method utilises a feature extraction method for acoustical music sequences. The extracted features are grouped by Minimum Bounding Rectangles (MBRs) and indexed by means of a spatial access method. We also present a novel false alarm resolution method that utilises a reverse order schema while calculating the distance of the query and results, in order to avoid costly operations. Performance evaluation results show that the proposed technique achieves considerable performance improvement in comparison to an existing method.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123410805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Semantic Video Summarization Using Mutual Reinforcement Principle and Shot Arrangement Patterns 基于相互增强原理和镜头排列模式的语义视频摘要
11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.64
Shi Lu, Michael R. Lyu, Irwin King
{"title":"Semantic Video Summarization Using Mutual Reinforcement Principle and Shot Arrangement Patterns","authors":"Shi Lu, Michael R. Lyu, Irwin King","doi":"10.1109/MMMC.2005.64","DOIUrl":"https://doi.org/10.1109/MMMC.2005.64","url":null,"abstract":"We propose a novel semantic video summarization framework, which generates video skimmings that guarantee both the balanced content coverage and the visual coherence. First, we collect video semantic information with a semi-automatic video annotation tool. Secondly, we analyze the video structure and determine each video scene’s target skim length. Then, mutual reinforcement principle is used to compute the relative importance value and cluster the video shots according to their semantic descriptions. Finally, we analyze the arrangement pattern of the video shots, and the key shot arrangement patterns are extracted to form the final video skimming, where the video shot importance value is used as guidance. Experiments are conducted to evaluate the effectiveness of our proposed approach.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"682 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116109449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Realistic 3D Face Modeling by Fusing Multiple 2D Images 通过融合多个2D图像实现逼真的3D人脸建模
11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.73
Changhu Wang, Shuicheng Yan, HongJiang Zhang, Wei-Ying Ma
{"title":"Realistic 3D Face Modeling by Fusing Multiple 2D Images","authors":"Changhu Wang, Shuicheng Yan, HongJiang Zhang, Wei-Ying Ma","doi":"10.1109/MMMC.2005.73","DOIUrl":"https://doi.org/10.1109/MMMC.2005.73","url":null,"abstract":"In this paper, we propose a fully automatic and efficient algorithm for realistic 3D face reconstruction by fusing multiple 2D face images. Firstly, an efficient multi-view 2D face alignment algorithm is utilized to localize the facial points of the face images; and then the intrinsic shape and texture models are inferred by the proposed Syncretized Shape Model (SSM) and Syncretized Texture Model (STM), respectively. Compared with other related works, our proposed algorithm has the following characteristics: 1) the inferred shape and texture are more realistic owing to the constraints and co-enhancement among the multiple images; 2) it is fully automatic, without any user interaction; and 3) the shape and pose parameter estimation is efficient via EM approach and unit quaternion based pose representation, and is also robust as a result of the dynamic correspondence approach. The experimental results show the effectiveness of our proposed algorithm for 3D face reconstruction.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122983618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Paving the Last Mile for Multi-Channel Multimedia Presentation Generation 为多通道多媒体演示生成铺路最后一公里
11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.58
A. Scherp, Susanne CJ Boll
{"title":"Paving the Last Mile for Multi-Channel Multimedia Presentation Generation","authors":"A. Scherp, Susanne CJ Boll","doi":"10.1109/MMMC.2005.58","DOIUrl":"https://doi.org/10.1109/MMMC.2005.58","url":null,"abstract":"Users of multimedia applications today are equipped with a variety of different (mobile) devices that each come with different operating systems, memory and CPU capabilities, network connections, and also different software such as multimedia players. To be able to efficiently deliver appealing multimedia presentations to all the users, we need to overcome the last mile in multimedia presentation delivery and meet the different requirements at the end user’s site. Consequently, one needs to provide multi-channel multimedia presentation generation such that all different users can get and use it in their individual device configuration. With our approach, we aim at developing an abstract multimedia content model that embeds the central characteristics of today’s multimedia presentation formats: the definition of the temporal and spatial layout as well as the interaction possibilities. The abstract model allows to be easily transformed to different multimedia presentation formats on different devices and by this serve different output channels. We present our abstract multimedia model and transformation process that allows to compose and to generate suitable multimedia presentations for each different channel such as a SMIL 2.0 presentation created for rendering on a PC or a SVG Tiny presentation designed for a mobile device. With our implementation, application developers can efficiently realize on-the-fly multichannel generation of multimedia content.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129104782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
A Cooperative Image Editing Tool over Mobile Phones 一个合作的图像编辑工具在移动电话
11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.6
Jian Zhai, Qing Li, Xiang Li, Wenyin Liu
{"title":"A Cooperative Image Editing Tool over Mobile Phones","authors":"Jian Zhai, Qing Li, Xiang Li, Wenyin Liu","doi":"10.1109/MMMC.2005.6","DOIUrl":"https://doi.org/10.1109/MMMC.2005.6","url":null,"abstract":"Nowadays, color screen, powerful computing processors, and high speed data communication protocols are supported by more and more mobile phones. Through these new technologies, it is possible for mobile phone users to download, view, and even edit images on their mobile phones. However, this poses a new challenge for mobile software developers if they want to allow several users to edit on the same image. Traditional methods (e.g., white board method) cannot be used under mobile environment due to its high communication cost. In this paper, we propose a cooperative editing tool based on an asynchronous method to greatly save the communication cost. The tool is successfully used as a plug-in in our Mobile Information and Resource Exchange System (MIRES).","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116186644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Novel Approach of 3D Reconstruction of Human Face Using Monocular Camera 一种基于单目摄像机的人脸三维重建新方法
11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.13
B. Yip, Jesse S. Jin
{"title":"A Novel Approach of 3D Reconstruction of Human Face Using Monocular Camera","authors":"B. Yip, Jesse S. Jin","doi":"10.1109/MMMC.2005.13","DOIUrl":"https://doi.org/10.1109/MMMC.2005.13","url":null,"abstract":"Three-dimensional model acquisition of an object is essential in many multimedia applications. Constructing three-dimensional models of objects from two-dimensional images is an old problem in the area of computer vision. There are many publications and our approach is specifically designed for constructing the depth map of a human face on frontal view, based on the head movement in a monocular setting. Our approach does not require a predefined face model. In this paper, along with the front view image of the user, three additional images with various head movement are captured, and the head pose is then calculated. The depth map is calculated through a triangular mesh, which the nodes on the mesh are the feature points that we calculate the depth with. Through image registration process, the feature points on the front view image are mapped to the other three images. Based on the head pose and the newly mapped coordinate, we could calculate the depth of the feature point. The depth results calculated from each of the three images are combined together to find the final depth value. Our approach does not require additional hardware, or predefined models. The result is not as good as we wished, but suggests avenues for future.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"88 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126199636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Performance Study of Gabor Filters and Rotation Invariant Gabor Filters Gabor滤波器及旋转不变Gabor滤波器的性能研究
11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.59
C. Ng, Guojun Lu, Dengsheng Zhang
{"title":"Performance Study of Gabor Filters and Rotation Invariant Gabor Filters","authors":"C. Ng, Guojun Lu, Dengsheng Zhang","doi":"10.1109/MMMC.2005.59","DOIUrl":"https://doi.org/10.1109/MMMC.2005.59","url":null,"abstract":"Gabor filters have been proven to be very useful for texture retrieval and are widely adopted (1, 2, 3, 4, 5, 6). However, the original Gabor texture features are rotation variant. Recently, Zhang et al proposed rotation normalization using circular shift. They have shown the proposed rotation normalization is effective through some examples, but did not comprehensively study its performance. The purpose of this paper is to study the performance of the rotation normalization on a good size texture database. Our experimental results show that the proposed rotation normalization is effective in retrieving rotated textures and has some adverse effect on retrieving non-rotated texture.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121974922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
A Generic Framework for Semantic Sports Video Analysis Using Dynamic Bayesian Networks 基于动态贝叶斯网络的语义体育视频分析通用框架
11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.9
Fei Wang, Yu-Fei Ma, HongJiang Zhang, Jintao Li
{"title":"A Generic Framework for Semantic Sports Video Analysis Using Dynamic Bayesian Networks","authors":"Fei Wang, Yu-Fei Ma, HongJiang Zhang, Jintao Li","doi":"10.1109/MMMC.2005.9","DOIUrl":"https://doi.org/10.1109/MMMC.2005.9","url":null,"abstract":"Automatic detection of semantic events in sport videos is a challenging task. In this paper, we propose a multimodal multilayer statistical inference framework for semantic sports video analysis using Dynamic Bayesian Networks (DBNs). Based on this framework, three instances including factorial hierarchical hidden Markov model (FHHMM), coupled hierarchical hidden Markov model (CHHMM), and product hierarchical hidden Markov model (PHHMM), are constructed and compared. Play-break detection in soccer videos is used as a testbed with hierarchical hidden Markov model (HHMM) as a baseline. Experimental results indicate the superior capability of the PHHMM, because it not only effectively models dynamic interactions between different modalities, but also sufficiently utilizes context constraints in multilayer structures.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129800476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 45
A Model for Meeting Content Storage and Retrieval 会议内容存储与检索模型
11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.12
S. Luz, M. Masoodian
{"title":"A Model for Meeting Content Storage and Retrieval","authors":"S. Luz, M. Masoodian","doi":"10.1109/MMMC.2005.12","DOIUrl":"https://doi.org/10.1109/MMMC.2005.12","url":null,"abstract":"This paper presents a model for storage of remote Internet-based multimedia meetings and information retrieval from textual and time-based content. The model builds on a theory of content mapping that exploits temporal and contextual relationships between media streams. Two prototypes are presented which illustrate the application of the model to a virtual meeting environment, and to a system for visualisation of meeting records on mobile devices. Implications of the proposed content mapping model with respect to interface design and non-linear browsing of time-based media are also discussed.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121670944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
A Graphical User Interface for Automatic Facial Texture Mapping Based on Orthogonal Photos 基于正交照片的人脸纹理自动映射图形用户界面
11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.10
S. Ferradal, J. Gómez
{"title":"A Graphical User Interface for Automatic Facial Texture Mapping Based on Orthogonal Photos","authors":"S. Ferradal, J. Gómez","doi":"10.1109/MMMC.2005.10","DOIUrl":"https://doi.org/10.1109/MMMC.2005.10","url":null,"abstract":"In this paper, a Graphical User Interface for the automatic generation of a facial texture mapping from three orthogonal photos is presented. The proposed method is based on the composition of the two side views (previously deformed) and the front view. The deformation is performed using a set of particular FDP (Facial Definition Points) of the MPEG-4 standard for multimedia applications. To smooth out the boundaries caused by the merging of the orthogonal views, a wavelet-based multiresolution filtering technique is employed. This work is the first stage in the development of a toolbox for automatic facial cloning.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"256 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132811674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信