11th International Multimedia Modelling Conference最新文献_第2页

Region-Based Image Retrieval with High-Level Semantic Color Names 基于区域的高级语义颜色名称图像检索

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.62

Y. Liu, Dengsheng Zhang, Guojun Lu, Wei-Ying Ma

引用次数: 69

Parallel Image Matrix Compression for Face Recognition 并行图像矩阵压缩用于人脸识别

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.57

Dong Xu, Shuicheng Yan, Lei Zhang, Mingjing Li, Wei-Ying Ma, Zhengkai Liu, HongJiang Zhang

{"title":"Parallel Image Matrix Compression for Face Recognition","authors":"Dong Xu, Shuicheng Yan, Lei Zhang, Mingjing Li, Wei-Ying Ma, Zhengkai Liu, HongJiang Zhang","doi":"10.1109/MMMC.2005.57","DOIUrl":"https://doi.org/10.1109/MMMC.2005.57","url":null,"abstract":"The canonical face recognition algorithm Eigenface and Fisherface are both based on one dimensional vector representation. However, with the high feature dimensions and the small training data, face recognition often suffers from the curse of dimension and the small sample problem. Recent research [4] shows that face recognition based on direct 2D matrix representation, i.e. 2DPCA, obtains better performance than that based on traditional vector representation. However, there are three questions left unresolved in the 2DPCA algorithm: I ) what is the meaning of the eigenvalue and eigenvector of the covariance matrix in 2DPCA; 2) why 2DPCA can outperform Eigenface; and 3) how to reduce the dimension after 2DPCA directly. In this paper, we analyze 2DPCA in a different view and proof that is 2DPCA actually a \"localized\" PCA with each row vector of an image as object. With this explanation, we discover the intrinsic reason that 2DPCA can outperform Eigenface is because fewer feature dimensions and more samples are used in 2DPCA when compared with Eigenface. To further reduce the dimension after 2DPCA, a two-stage strategy, namely parallel image matrix compression (PIMC), is proposed to compress the image matrix redundancy, which exists among row vectors and column vectors. The exhaustive experiment results demonstrate that PIMC is superior to 2DPCA and Eigenface, and PIMC+LDA outperforms 2DPC+LDA and Fisherface.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"7 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114010479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 21

Color Time Petri Net for Interactive Adaptive Multimedia Objects 交互式自适应多媒体对象的色时Petri网

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.26

A. Gomaa, N. Adam, V. Atluri

{"title":"Color Time Petri Net for Interactive Adaptive Multimedia Objects","authors":"A. Gomaa, N. Adam, V. Atluri","doi":"10.1109/MMMC.2005.26","DOIUrl":"https://doi.org/10.1109/MMMC.2005.26","url":null,"abstract":"A composite multimedia object (cmo) is comprised of different media components such as text, video, audio and image, with a variety of constraints that must be adhered to. The constraints are 1) rendering relationships that comprise the temporal and spatial constraints between different components, 2) behavioral requirements that include the security and fidelity constraints on each component and, 3) user interactions on a set of related media components. Different users have different capabilities (e.g. age), characteristics (e.g. monitor size) and credentials (e.g. subscription to service). Our objective is to author an interactive adaptive cmo that renders itself correctly to different users. Therefore, it is important to guarantee the consistency of the cmo specifications in all possible scenarios. In this paper, we include the user interaction with temporal and spatio-temporal behavior in the specification of the adaptive cmo. We then check the consistency of user interaction specifications by transforming the specifications into a color time Petri net model. We perform a reachability analysis on the Petri net to identify inconsistencies. We then resolve the identified inconsistencies to have a consistent Petri net. A consistent Petri net presents an error-free interactive cmo that can adapt to different users, by guaranteeing that link user interactions are reachable for all eligible users.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115622448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Sports Video Mining with Mosaic 体育视频挖掘与马赛克

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.68

Tao Mei, Yu-Fei Ma, He-Qin Zhou, Wei-Ying Ma, HongJiang Zhang

引用次数: 21

Image Mining and Retrieval Using Hierarchical Support Vector Machines 基于层次支持向量机的图像挖掘与检索

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.48

R. Brown, Binh Pham

引用次数: 18

Cyber Composer: Hand Gesture-Driven Intelligent Music Composition and Generation 网络作曲家:手势驱动的智能音乐创作和生成

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.32

H. Ip, K. Law, Belton Kwong

引用次数: 38

Analyzing Tennis Tactics from Broadcasting Tennis Video Clips 从广播网球视频片段分析网球战术

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.20

Jenny R. Wang, N. Parameswaran

引用次数: 35

An Interactive Camera Planning System for Automatic Cinematographer 用于自动电影摄影的交互式摄像机规划系统

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.19

Tsai-Yen Li, X. Xiao

引用次数: 15

Recognition of Enhanced Images 增强图像识别

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.61

Khanh Vu, K. Hua, N. Hiransakolwong, Sirikunya Nilpanich

{"title":"Recognition of Enhanced Images","authors":"Khanh Vu, K. Hua, N. Hiransakolwong, Sirikunya Nilpanich","doi":"10.1109/MMMC.2005.61","DOIUrl":"https://doi.org/10.1109/MMMC.2005.61","url":null,"abstract":"Image enhancement such as adjusting brightness and contrast is central to improving human visualization of images’ content. Images in desired enhanced quality facilitate analysis, interpretation, classification, information exchange, indexing and retrieval. The adjustment process, guided by diverse enhancement objectives and subjective human judgment, often produces various versions of the same image. Despite the preservation of content under these operations, enhanced images are treated as new in most existing techniques via their widely different features. This leads to difficulties in recognition and retrieval of images across application domains and user interest. To allow unrestricted enhancement flexibility, accurate identification of images and their enhanced versions is therefore essential. In this paper, we introduce a measure that theoretically guarantees the identification of all enhanced images originated from one. In our approach, images are represented by points in multidimensional intensity-based space. We show that points representing images of the same content are confined in a well-defined area that can be identified by a so-devised formula. We evaluated our technique on large sets of images from various categories, including medical, satellite, texture, color images and scanned documents. The proposed measure yields an actual recognition rate approaching 100% in all image categories, outperforming other well-known techniques by a wide margin. Our analysis at the same time can serve as a basis for determining the minimum criterion a similarity measure should satisfy. We discuss also how to apply the formula as a similarity measure in existing systems to support general image retrieval.","PeriodicalId":121228,"journal":{"name":"11th International Multimedia Modelling Conference","volume":"58 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126972877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Video Snapshot: A Bird View of Video Sequence 视频快照:视频序列的鸟瞰图

11th International Multimedia Modelling Conference Pub Date : 2005-01-12 DOI: 10.1109/MMMC.2005.71

Yu-Fei Ma, HongJiang Zhang

引用次数: 29