Improving object detection in paintings based on time contexts

2020 International Conference on Data Mining Workshops (ICDMW) Pub Date : 2020-11-01 DOI:10.1109/ICDMW51313.2020.00133

M. Marinescu, Artem Reshetnikov, J. M. López

{"title":"Improving object detection in paintings based on time contexts","authors":"M. Marinescu, Artem Reshetnikov, J. M. López","doi":"10.1109/ICDMW51313.2020.00133","DOIUrl":null,"url":null,"abstract":"This paper proposes a novel approach to object detection for the Cultural Heritage domain, which relies on combining Deep Learning and semantic metadata about candidate objects extracted from existing sources such as Wikidata, dictionaries, or Google NGram. Working with cultural heritage presents challenges not present in every-day images. In computer vision, object detection models are usually trained with datasets whose classes are not imaginary concepts, and have neither symbolic nor time-specific dimensions. Apart from this conceptual problem, the paintings are limited in number and represent the same concept in potentially very different styles. Finally, the metadata associated with the images is often poor or inexistent, which makes it hard to properly train a model. Our approach can improve the precision of object detection by placing the classes detected by a neural network model in time, based on the dates of their first known use. By taking into account the time of inception of objects such as the TV, cell phone, or scissors, and the appearance of some objects in the geographical space that corresponds to a painting (e.g. bananas or broccoli in 15th century Europe), we can correct and refine the detected objects based on their chronologic probability.","PeriodicalId":426846,"journal":{"name":"2020 International Conference on Data Mining Workshops (ICDMW)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Data Mining Workshops (ICDMW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMW51313.2020.00133","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

This paper proposes a novel approach to object detection for the Cultural Heritage domain, which relies on combining Deep Learning and semantic metadata about candidate objects extracted from existing sources such as Wikidata, dictionaries, or Google NGram. Working with cultural heritage presents challenges not present in every-day images. In computer vision, object detection models are usually trained with datasets whose classes are not imaginary concepts, and have neither symbolic nor time-specific dimensions. Apart from this conceptual problem, the paintings are limited in number and represent the same concept in potentially very different styles. Finally, the metadata associated with the images is often poor or inexistent, which makes it hard to properly train a model. Our approach can improve the precision of object detection by placing the classes detected by a neural network model in time, based on the dates of their first known use. By taking into account the time of inception of objects such as the TV, cell phone, or scissors, and the appearance of some objects in the geographical space that corresponds to a painting (e.g. bananas or broccoli in 15th century Europe), we can correct and refine the detected objects based on their chronologic probability.

查看原文本刊更多论文

改进基于时间背景的绘画对象检测

本文提出了一种新的文化遗产领域对象检测方法，该方法依赖于结合深度学习和从现有来源(如Wikidata、字典或Google NGram)中提取的候选对象的语义元数据。从事文化遗产的工作所面临的挑战并不存在于日常图像中。在计算机视觉中，目标检测模型通常使用数据集进行训练，这些数据集的类别不是虚构的概念，既没有符号维度，也没有特定时间的维度。除了这个概念上的问题之外，这些画作的数量有限，并且以可能非常不同的风格代表了相同的概念。最后，与图像相关的元数据通常很差或不存在，这使得很难正确训练模型。我们的方法可以通过将神经网络模型检测到的类根据其首次使用的日期及时放置，从而提高目标检测的精度。通过考虑诸如电视、手机或剪刀等物体出现的时间，以及与一幅画对应的地理空间中某些物体的出现(例如15世纪欧洲的香蕉或西兰花)，我们可以根据它们的时间概率来纠正和完善检测到的物体。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 International Conference on Data Mining Workshops (ICDMW)

自引率

0.00%

发文量