图形地图中人物的实例分割、身体部位解析与姿态估计

IF 0.9 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS

International Journal of Cartography Pub Date : 2021-08-10 DOI:10.1080/23729333.2021.1949087

R. Schnürer, A. Cengiz Öztireli, M. Heitzler, R. Sieber, L. Hurni

{"title":"图形地图中人物的实例分割、身体部位解析与姿态估计","authors":"R. Schnürer, A. Cengiz Öztireli, M. Heitzler, R. Sieber, L. Hurni","doi":"10.1080/23729333.2021.1949087","DOIUrl":null,"url":null,"abstract":"ABSTRACT In recent years, convolutional neural networks (CNNs) have been applied successfully to recognise persons, their body parts and pose keypoints in photos and videos. The transfer of these techniques to artificially created images is rather unexplored, though challenging since these images are drawn in different styles, body proportions, and levels of abstraction. In this work, we study these problems on the basis of pictorial maps where we identify included human figures with two consecutive CNNs: We first segment individual figures with Mask R-CNN, and then parse their body parts and estimate their poses simultaneously with four different UNet++ versions. We train the CNNs with a mixture of real persons and synthetic figures and compare the results with manually annotated test datasets consisting of pictorial figures. By varying the training datasets and the CNN configurations, we were able to improve the original Mask R-CNN model and we achieved moderately satisfying results with the UNet++ versions. The extracted figures may be used for animation and storytelling and may be relevant for the analysis of historic and contemporary maps.","PeriodicalId":36401,"journal":{"name":"International Journal of Cartography","volume":"111 3S 1","pages":"291 - 307"},"PeriodicalIF":0.9000,"publicationDate":"2021-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Instance Segmentation, Body Part Parsing, and Pose Estimation of Human Figures in Pictorial Maps\",\"authors\":\"R. Schnürer, A. Cengiz Öztireli, M. Heitzler, R. Sieber, L. Hurni\",\"doi\":\"10.1080/23729333.2021.1949087\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT In recent years, convolutional neural networks (CNNs) have been applied successfully to recognise persons, their body parts and pose keypoints in photos and videos. The transfer of these techniques to artificially created images is rather unexplored, though challenging since these images are drawn in different styles, body proportions, and levels of abstraction. In this work, we study these problems on the basis of pictorial maps where we identify included human figures with two consecutive CNNs: We first segment individual figures with Mask R-CNN, and then parse their body parts and estimate their poses simultaneously with four different UNet++ versions. We train the CNNs with a mixture of real persons and synthetic figures and compare the results with manually annotated test datasets consisting of pictorial figures. By varying the training datasets and the CNN configurations, we were able to improve the original Mask R-CNN model and we achieved moderately satisfying results with the UNet++ versions. The extracted figures may be used for animation and storytelling and may be relevant for the analysis of historic and contemporary maps.\",\"PeriodicalId\":36401,\"journal\":{\"name\":\"International Journal of Cartography\",\"volume\":\"111 3S 1\",\"pages\":\"291 - 307\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2021-08-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Cartography\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/23729333.2021.1949087\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Cartography","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/23729333.2021.1949087","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 2

摘要

近年来，卷积神经网络(cnn)已经成功地应用于识别照片和视频中的人物、身体部位和姿势关键点。将这些技术转移到人工创建的图像是相当未开发的，尽管具有挑战性，因为这些图像以不同的风格，身体比例和抽象水平绘制。在这项工作中，我们在图像地图的基础上研究了这些问题，我们用两个连续的cnn来识别被包含的人物:我们首先用Mask R-CNN分割个人人物，然后用四个不同的unet++版本同时解析他们的身体部位并估计他们的姿势。我们用真人和合成图混合训练cnn，并将结果与人工标注的由图形组成的测试数据集进行比较。通过改变训练数据集和CNN配置，我们能够改进原始的Mask R-CNN模型，并在unnet++版本中获得了中等满意的结果。提取的数据可以用于动画和讲故事，也可以用于历史和当代地图的分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Instance Segmentation, Body Part Parsing, and Pose Estimation of Human Figures in Pictorial Maps

ABSTRACT In recent years, convolutional neural networks (CNNs) have been applied successfully to recognise persons, their body parts and pose keypoints in photos and videos. The transfer of these techniques to artificially created images is rather unexplored, though challenging since these images are drawn in different styles, body proportions, and levels of abstraction. In this work, we study these problems on the basis of pictorial maps where we identify included human figures with two consecutive CNNs: We first segment individual figures with Mask R-CNN, and then parse their body parts and estimate their poses simultaneously with four different UNet++ versions. We train the CNNs with a mixture of real persons and synthetic figures and compare the results with manually annotated test datasets consisting of pictorial figures. By varying the training datasets and the CNN configurations, we were able to improve the original Mask R-CNN model and we achieved moderately satisfying results with the UNet++ versions. The extracted figures may be used for animation and storytelling and may be relevant for the analysis of historic and contemporary maps.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Cartography Social Sciences-Geography, Planning and Development

CiteScore

1.40

自引率

0.00%

发文量