2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR)最新文献_第2页

GARGI: Selecting Gaze-Aware Representative Group Image from a Live Photo GARGI:从实时照片中选择具有注视意识的代表性群体图像

2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2022-08-01 DOI: 10.1109/MIPR54900.2022.00027

Omkar N. Kulkarni, Shashank Arora, P. Atrey

{"title":"GARGI: Selecting Gaze-Aware Representative Group Image from a Live Photo","authors":"Omkar N. Kulkarni, Shashank Arora, P. Atrey","doi":"10.1109/MIPR54900.2022.00027","DOIUrl":"https://doi.org/10.1109/MIPR54900.2022.00027","url":null,"abstract":"The number of photos, especially group photos in live mode, has increased tremendously in today's world. Selecting a representative image in a live photo that preserves the aesthetic quality is a challenging task. In this paper, we propose a method to select a Gaze-Aware Representative Group Image, called GARGI, that considers the uni-formity, or consequently the deviation, of the people's gaze in live-mode group photos to make it aesthetically pleasing. We tested this method on our own live-mode group image dataset. We argue that the inbuilt representative im-age selection mechanism in an Apple iPhone does not con-sider the subject's gaze, especially in a group image. The GARGI considers the deviation of gazes for each subject with respect to their expected gaze directions and deter-mines an aesthetically better representative image with the least amount of gaze deviation for all the subjects. The re-sults presented in the paper also justify this claim. They can be used to pave the way for becoming a standard in any keyframe selection mechanisms that will include human subjects in live photos, burst mode shots, or even in videos.","PeriodicalId":228640,"journal":{"name":"2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122060632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Creative Improvised Interaction with Generative Musical Systems 创造性的即兴互动与生成音乐系统

2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2022-08-01 DOI: 10.1109/MIPR54900.2022.00028

S. Dubnov, G. Assayag, V. Gokul

引用次数: 0

Rate-Adaptive Streaming of 360-Degree Videos with Head-Motion-Aware Viewport Margins 具有头部运动感知视口边缘的360度视频的速率自适应流

2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2022-08-01 DOI: 10.1109/MIPR54900.2022.00056

Mehmet N. Akcay, Burak Kara, A. Begen, Saba Ahsan, I. Curcio, Emre B. Aksu

引用次数: 2

A Local-Global Metric Learning Method for Facial Expression Animation 人脸表情动画的局部-全局度量学习方法

2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2022-08-01 DOI: 10.1109/MIPR54900.2022.00046

Pengcheng Gao, Bin Huang, Jiayi Lyu, Haifeng Ma, Jian Xue

引用次数: 0

Wasserstein Metric Attack on Person Re-identification Wasserstein对人再识别的度量攻击

2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2022-08-01 DOI: 10.1109/MIPR54900.2022.00049

Astha Verma, A. Subramanyam, R. Shah

引用次数: 1

Personalized Fashion Sequential Recommendation with Visual Feature Based on Conditional Hierarchical VAE 基于条件分层VAE的视觉特征个性化时装序列推荐

2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2022-08-01 DOI: 10.1109/MIPR54900.2022.00071

Keiichi Suekane, Ryoichi Osawa, Aozora Inagaki, Taiga Matsui, Tomohiro Tanabe, Keita Ishikawa, T. Takagi

引用次数: 2

Fast VVC Intra Coding by Skipping Redundant Coding Block Structures and Unnecessary Directional Partition 跳过冗余编码块结构和不必要的方向划分的快速VVC内部编码

2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2022-08-01 DOI: 10.1109/MIPR54900.2022.00022

Ziheng Zhang, Chang-Hong Fu, Kai Xie, Hong Hong, Guan-Ming Su

引用次数: 1

ExpressionHop: A Lightweight Human Facial Expression Classifier ExpressionHop:一个轻量级的人类面部表情分类器

2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2022-08-01 DOI: 10.1109/MIPR54900.2022.00042

Chengwei Wei, C. J. Kuo, R. L. Testa, Ariane Machado-Lima, Fátima L. S. Nunes

引用次数: 3

INTERPRETABLE LEARNING-BASED MULTI-MODAL HASHING ANALYSIS FOR MULTI-VIEW FEATURE REPRESENTATION LEARNING 基于可解释学习的多模态哈希分析，用于多视图特征表示学习

2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2022-08-01 DOI: 10.1109/MIPR54900.2022.00016

Lei Gao, L. Guan

{"title":"INTERPRETABLE LEARNING-BASED MULTI-MODAL HASHING ANALYSIS FOR MULTI-VIEW FEATURE REPRESENTATION LEARNING","authors":"Lei Gao, L. Guan","doi":"10.1109/MIPR54900.2022.00016","DOIUrl":"https://doi.org/10.1109/MIPR54900.2022.00016","url":null,"abstract":"In this work, an interpretable learning-based multi-modal hashing analysis (ILMMHA) model is proposed with appli-cation to multi-view feature representation learning. In the proposed model, a cascade network structure is first utilized to reveal the intrinsically semantic representation of input variables. Then, a multi-modal hashing (MMH) method is integrated with the explored semantic representation, gener-ating an interpretable learning-based model for multi-view feature representation. Since MMH is capable of measuring semantic similarity across multiple variables jointly, it provides a natural link between the explored intrinsically semantic representation and its similarity across multi-modal data/information. Benefiting from integration of the cascade structure and MMH, the ILMMHA model leads to a new multi-view feature representation of high quality. To demonstrate the effectiveness and generic nature of the ILMMHA model, we conduct experiments on the cross-modal based audio-visual emotion and text-image recognition tasks, respectively. Experimental results demonstrate the superiority of the proposed model on multi-view feature representation learning.","PeriodicalId":228640,"journal":{"name":"2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115061489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Machine-Learning Based High Efficiency Rate Control for AV1 基于机器学习的AV1高效率控制

2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2022-08-01 DOI: 10.1109/MIPR54900.2022.00019

Yi Chen, Yunhao Mao, Shiqi Wang, Xianguo Zhang, S. Kwong

引用次数: 0