2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)最新文献_第2页

Text Style Transfer With Decorative Elements 文字风格转移与装饰元素

2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2021-09-01 DOI: 10.1109/MIPR51284.2021.00062

Yuting Ma, Fan Tang, Weiming Dong, Changsheng Xu

引用次数: 0

Supplementing Omitted Named Entities in Cooking Procedural Text with Attached Images 用附加图像补充烹饪过程文本中遗漏的命名实体

2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2021-09-01 DOI: 10.1109/MIPR51284.2021.00037

Yixin Zhang, Yoko Yamakata, Keishi Tajima

{"title":"Supplementing Omitted Named Entities in Cooking Procedural Text with Attached Images","authors":"Yixin Zhang, Yoko Yamakata, Keishi Tajima","doi":"10.1109/MIPR51284.2021.00037","DOIUrl":"https://doi.org/10.1109/MIPR51284.2021.00037","url":null,"abstract":"In this research, we aim at supplementing named entities, such as food, omitted in the procedural text of recipe data. It helps users understand the recipe and is also necessary for the machine to understand the recipe data automatically. The contribution of this research is as follows. (1) We construct a dataset of Chinese recipes consisting of 12,548 recipes. To detect sentences in which food entities are omitted, we label named entities such as food, tool, and cooking actions in the procedural text by using the automatic recipe named entity recognition method. (2) We propose a method of recognizing food from the attached images. A procedural text of recipe data is often associated with an image, and the attached image often contains the food even when it is omitted in the procedural text. Tool entities in images in recipe data can be identified with high accuracy by conventional general object recognition techniques. On the other hand, the general object recognition methods in the literature, which assume that the properties of an object are constant, perform not well for food in recipe image data because food states change during cooking procedures. To solve this problem, we propose a method of obtaining food entity candidates from other steps that are similar to the target step, both in sentence similarity and image feature similarity. Among all the 246,195 procedural steps in our dataset, there are 16,593 steps in which the food entity is omitted in the procedural text. Our method is applied to supplement the food entities in these steps and achieves the accuracy of 67.55%.","PeriodicalId":139543,"journal":{"name":"2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130162051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Practice-Oriented Real-time Person Occurrence Search System 面向实践的实时人员发生搜索系统

2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2021-09-01 DOI: 10.1109/MIPR51284.2021.00040

S. Yamazaki, Hui Lam Ong, Jianquan Liu, Wei Jian Peh, Hong Yen Ong, Qinyu Huang, Xinlai Jiang

{"title":"Practice-Oriented Real-time Person Occurrence Search System","authors":"S. Yamazaki, Hui Lam Ong, Jianquan Liu, Wei Jian Peh, Hong Yen Ong, Qinyu Huang, Xinlai Jiang","doi":"10.1109/MIPR51284.2021.00040","DOIUrl":"https://doi.org/10.1109/MIPR51284.2021.00040","url":null,"abstract":"Face recognition is a potential technology to realize Person Occurrence Search (POS) application which retrieves all occurrences of a target person over multiple cameras. From the industry perspective, such a POS application requires a practice-oriented system that can respond to search requests in seconds, return search results nearly without false positives, and handle the variations of face angles and illumination in camera views. In this paper, we demonstrate a real-time person occurrence search system that adopts person re-identification for person occurrence tracking to achieve extremely low false positives. Our proposed system performs face detection and face clustering in an online manner to drastically reduce the response time of search requests from users. To retrieve person occurrence count and duration quickly, we design a process so-called Logical Occurrences that utilizes the maximum interval of detected time of faces to efficiently compute the occurrence count. Such a process can reduce the online computational complexity from O(M2) to O(M) by pre-computing elapsed time during the online face clustering. The proposed system is evaluated on a real data set which contains about 1 million of detected faces for search. In the experiments, our system responds to search requests within 2 seconds on average, and achieves 99.9% precision of search results over more than 200 actual search requests.","PeriodicalId":139543,"journal":{"name":"2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116847436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Violent Scene Detection of Film Videos Based on Multi-Task Learning of Temporal-Spatial Features 基于时空特征多任务学习的电影视频暴力场景检测

2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2021-09-01 DOI: 10.1109/MIPR51284.2021.00067

Z. Zheng, Wei Zhong, Long Ye, Li Fang, Qin Zhang

{"title":"Violent Scene Detection of Film Videos Based on Multi-Task Learning of Temporal-Spatial Features","authors":"Z. Zheng, Wei Zhong, Long Ye, Li Fang, Qin Zhang","doi":"10.1109/MIPR51284.2021.00067","DOIUrl":"https://doi.org/10.1109/MIPR51284.2021.00067","url":null,"abstract":"In this paper, we propose a new framework for the violent scene detection of film videos based on multi-task learning of temporal-spatial features. In the proposed framework, for the violent behavior representation of film clips, we employ a temporal excitation and aggregation network to extract the temporal-spatial deep features in the visual modality. And on the other hand, a recurrent neural network with local attention is utilized to extract the utterance-level representation of affective analysis in the audio modality. In the process of feature mapping, we concern the task of violent scene detection together with that of affective analysis and then propose a multi-task learning strategy to effectively predict the violent scene of film clips. To evaluate the effectiveness of the proposed method, the experiments are done on the task of violent scenes detection 2015. The experimental results show that our model outperforms most of the state of the art methods, validating the innovation of considering the task of violent scene detection jointly with the violence emotion analysis.","PeriodicalId":139543,"journal":{"name":"2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116775365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Hardness Prediction for More Reliable Attribute-based Person Re-identification

2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2021-09-01 DOI: 10.1109/MIPR51284.2021.00077

Lucas Florin, Andreas Specker, Arne Schumann, J. Beyerer

{"title":"Hardness Prediction for More Reliable Attribute-based Person Re-identification","authors":"Lucas Florin, Andreas Specker, Arne Schumann, J. Beyerer","doi":"10.1109/MIPR51284.2021.00077","DOIUrl":"https://doi.org/10.1109/MIPR51284.2021.00077","url":null,"abstract":"Recognition of person attributes in surveillance camera imagery is often used as an auxiliary cue in person re-identification approaches. Additionally, increasingly more attention is being payed to the cross modal task of person re-identification based purely on attribute queries. In both of these settings, the reliability of attribute predictions is crucial for success. However, the task attribute recognition is affected by several non-trivial challenges. These include common aspects, such as degraded image quality through low resolution, motion blur, lighting conditions and similar factors. Another important factor in the context of attribute recognition is, however, the lack of visibility due to occlusion through scene objects, other persons or self-occlusion or simply due to mis-cropped person detections. All these factors make attribute prediction challenging and the resulting detections everything but reliable. In order to improve their applicability to person re-identification, we propose to apply hardness prediction models and provide an additional hardness score with each attribute that measures the likelihood of the actual prediction to be reliable. We investigate several key aspects of hardness prediction in the context of attribute recognition and compare our resulting hardness predictor to several alternatives. Finally, we include the hardness prediction into an attribute-based re-identification task and show improvements in the resulting accuracy. Our code is available at https://github.com/Lucas-Florin/hardness-predictor-for-par.","PeriodicalId":139543,"journal":{"name":"2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125256411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Cross-domain Person Re-Identification with Identity-preserving Style Transfer 保留身份风格迁移的跨域人物再认同

2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2021-09-01 DOI: 10.1109/MIPR51284.2021.00008

Shixing Chen, Caojin Zhang, Mingtao Dong, Chengcui Zhang

引用次数: 0

Design and Development of an Intelligent Pet-Type Quadruped Robot 智能宠物型四足机器人的设计与开发

2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2021-09-01 DOI: 10.1109/MIPR51284.2021.00068

Feng Gao, Chengjia Lei, Xingguo Long, Jin Wang, Peiheng Song

引用次数: 1

Effect of Walkability on Rental Prices in Tokyo 东京可步行性对租房价格的影响

2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2021-09-01 DOI: 10.1109/MIPR51284.2021.00054

A. Bramson, Megumi Hori

引用次数: 0

Smart Portable Musical Simulation System Based on Unified Temperament 基于统一气质的智能便携式音乐模拟系统

2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2021-09-01 DOI: 10.1109/MIPR51284.2021.00069

Lin Gan, Li Lv, Cuicui Wang, Mu Zhang

引用次数: 0

A probabilistic and random method for the generation of Bai nationality music fragments 白族音乐片段的概率随机生成方法

2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2021-09-01 DOI: 10.1109/MIPR51284.2021.00057

Pengcheng Shang, Shan Ni, Li Zhou

引用次数: 1