从库存照片到幽灵数据:追踪关于欧盟的视听档案

VIEW Journal of European Television History and Culture Pub Date : 2023-09-05 DOI:10.18146/view.292

Shiming Shen, Matteo Treleani, Dario Compagno, Marco Winckler

{"title":"从库存照片到幽灵数据:追踪关于欧盟的视听档案","authors":"Shiming Shen, Matteo Treleani, Dario Compagno, Marco Winckler","doi":"10.18146/view.292","DOIUrl":null,"url":null,"abstract":"This paper deals with a major challenge linked to the collection of audiovisual documents within television and web archives. Looking for repeated sequences within a corpus of thousands of videos, we faced the fact that the footage we were looking for reveals itself to be reachable only as ghost data. In fact, any audiovisual sequence reused within different contexts exists conceptually as the repetition of one single visual unit, but from the point of view of the metadata tagging its occurrences, each item is a distinct document. Like a ghost, the shot is there, scattered among different places, but the metadata cannot point us to the visual form repeated, despite its evidence to the human viewer. When facing large amounts of data, to relate a visual unit to its occurrences, data analysis techniques are needed. We describe our procedures of collection and annotation, and the solutions combining qualitive work and a computer-aided approach to face this main challenge, within the research project Crossing Borders Archives (CROBORA).","PeriodicalId":115199,"journal":{"name":"VIEW Journal of European Television History and Culture","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"From Stock Shots to Ghost Data: Tracking Audiovisual Archives about the European Union\",\"authors\":\"Shiming Shen, Matteo Treleani, Dario Compagno, Marco Winckler\",\"doi\":\"10.18146/view.292\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper deals with a major challenge linked to the collection of audiovisual documents within television and web archives. Looking for repeated sequences within a corpus of thousands of videos, we faced the fact that the footage we were looking for reveals itself to be reachable only as ghost data. In fact, any audiovisual sequence reused within different contexts exists conceptually as the repetition of one single visual unit, but from the point of view of the metadata tagging its occurrences, each item is a distinct document. Like a ghost, the shot is there, scattered among different places, but the metadata cannot point us to the visual form repeated, despite its evidence to the human viewer. When facing large amounts of data, to relate a visual unit to its occurrences, data analysis techniques are needed. We describe our procedures of collection and annotation, and the solutions combining qualitive work and a computer-aided approach to face this main challenge, within the research project Crossing Borders Archives (CROBORA).\",\"PeriodicalId\":115199,\"journal\":{\"name\":\"VIEW Journal of European Television History and Culture\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"VIEW Journal of European Television History and Culture\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18146/view.292\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"VIEW Journal of European Television History and Culture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18146/view.292","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文讨论了电视和网络档案中视听文件的收集所面临的主要挑战。在数千个视频的语料库中寻找重复序列时，我们面临的事实是，我们正在寻找的镜头本身只能作为幽灵数据访问。实际上，在不同上下文中重用的任何视听序列在概念上都是单个视觉单元的重复，但是从标记其出现的元数据的角度来看，每个项都是一个不同的文档。就像鬼魂一样，镜头就在那里，分散在不同的地方，但元数据无法向我们指出重复的视觉形式，尽管它对人类观众来说是证据。当面对大量数据时，要将视觉单元与其出现的情况联系起来，就需要数据分析技术。我们描述了我们的收集和注释过程，以及结合定性工作和计算机辅助方法来面对这一主要挑战的解决方案，在研究项目跨国界档案(CROBORA)中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

From Stock Shots to Ghost Data: Tracking Audiovisual Archives about the European Union

This paper deals with a major challenge linked to the collection of audiovisual documents within television and web archives. Looking for repeated sequences within a corpus of thousands of videos, we faced the fact that the footage we were looking for reveals itself to be reachable only as ghost data. In fact, any audiovisual sequence reused within different contexts exists conceptually as the repetition of one single visual unit, but from the point of view of the metadata tagging its occurrences, each item is a distinct document. Like a ghost, the shot is there, scattered among different places, but the metadata cannot point us to the visual form repeated, despite its evidence to the human viewer. When facing large amounts of data, to relate a visual unit to its occurrences, data analysis techniques are needed. We describe our procedures of collection and annotation, and the solutions combining qualitive work and a computer-aided approach to face this main challenge, within the research project Crossing Borders Archives (CROBORA).

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

VIEW Journal of European Television History and Culture

自引率

0.00%

发文量