Shiming Shen, Matteo Treleani, Dario Compagno, Marco Winckler
{"title":"从库存照片到幽灵数据:追踪关于欧盟的视听档案","authors":"Shiming Shen, Matteo Treleani, Dario Compagno, Marco Winckler","doi":"10.18146/view.292","DOIUrl":null,"url":null,"abstract":"This paper deals with a major challenge linked to the collection of audiovisual documents within television and web archives. Looking for repeated sequences within a corpus of thousands of videos, we faced the fact that the footage we were looking for reveals itself to be reachable only as ghost data. In fact, any audiovisual sequence reused within different contexts exists conceptually as the repetition of one single visual unit, but from the point of view of the metadata tagging its occurrences, each item is a distinct document. Like a ghost, the shot is there, scattered among different places, but the metadata cannot point us to the visual form repeated, despite its evidence to the human viewer. When facing large amounts of data, to relate a visual unit to its occurrences, data analysis techniques are needed. We describe our procedures of collection and annotation, and the solutions combining qualitive work and a computer-aided approach to face this main challenge, within the research project Crossing Borders Archives (CROBORA).","PeriodicalId":115199,"journal":{"name":"VIEW Journal of European Television History and Culture","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"From Stock Shots to Ghost Data: Tracking Audiovisual Archives about the European Union\",\"authors\":\"Shiming Shen, Matteo Treleani, Dario Compagno, Marco Winckler\",\"doi\":\"10.18146/view.292\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper deals with a major challenge linked to the collection of audiovisual documents within television and web archives. Looking for repeated sequences within a corpus of thousands of videos, we faced the fact that the footage we were looking for reveals itself to be reachable only as ghost data. In fact, any audiovisual sequence reused within different contexts exists conceptually as the repetition of one single visual unit, but from the point of view of the metadata tagging its occurrences, each item is a distinct document. Like a ghost, the shot is there, scattered among different places, but the metadata cannot point us to the visual form repeated, despite its evidence to the human viewer. When facing large amounts of data, to relate a visual unit to its occurrences, data analysis techniques are needed. We describe our procedures of collection and annotation, and the solutions combining qualitive work and a computer-aided approach to face this main challenge, within the research project Crossing Borders Archives (CROBORA).\",\"PeriodicalId\":115199,\"journal\":{\"name\":\"VIEW Journal of European Television History and Culture\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"VIEW Journal of European Television History and Culture\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18146/view.292\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"VIEW Journal of European Television History and Culture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18146/view.292","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
From Stock Shots to Ghost Data: Tracking Audiovisual Archives about the European Union
This paper deals with a major challenge linked to the collection of audiovisual documents within television and web archives. Looking for repeated sequences within a corpus of thousands of videos, we faced the fact that the footage we were looking for reveals itself to be reachable only as ghost data. In fact, any audiovisual sequence reused within different contexts exists conceptually as the repetition of one single visual unit, but from the point of view of the metadata tagging its occurrences, each item is a distinct document. Like a ghost, the shot is there, scattered among different places, but the metadata cannot point us to the visual form repeated, despite its evidence to the human viewer. When facing large amounts of data, to relate a visual unit to its occurrences, data analysis techniques are needed. We describe our procedures of collection and annotation, and the solutions combining qualitive work and a computer-aided approach to face this main challenge, within the research project Crossing Borders Archives (CROBORA).