Madison Leeson, Riccardo Giovanelli, Sara Ferro, Michela De Bernardin, Arianna Traviglia
{"title":"克服文化遗产犯罪研究中的数据孤岛:关于艺术品、古物和文化商品贸易的综合osint衍生数据集","authors":"Madison Leeson, Riccardo Giovanelli, Sara Ferro, Michela De Bernardin, Arianna Traviglia","doi":"10.1007/s10502-025-09485-x","DOIUrl":null,"url":null,"abstract":"<div><p>The current landscape for provenance researchers, cultural heritage crime analysts, and law enforcement working in the culture sector is characterised by the siloing of data across dozens of databases, resulting in fragmented and incomplete resources that must be manually correlated and validated to provide insights into cultural heritage crime. The European Union-funded Research, Intelligence, and Technology for Heritage and Market Security (RITHMS) project is developing a platform to assist law enforcement agencies across Europe in tackling the illicit trafficking of cultural goods by aggregating open, specialised, and police data from a range of sources. This article outlines one of the initial phases of data collection, which has developed 30 tailored web scrapers for the collection of data from existing databases of stolen, missing, protected, and unprovenanced cultural goods. This has resulted in the largest known non-police dataset of these and associated data objects addressing the real-world challenge of data siloes in heritage crime and provenance research. This article details the multi-step process of data collection and pre-processing that has produced the novel consolidated dataset. The mechanism for knowledge discovery developed during this project has immediate applications and has resulted in actionable intelligence for the investigation of cultural goods crimes, highlighting the value of consolidated data as a resource. This research also offers a cursory analysis of the resulting dataset, demonstrating how mining of this resource can enable new scientific insights and offer promising opportunities for intelligence-led policing of cultural heritage crimes.</p></div>","PeriodicalId":46131,"journal":{"name":"ARCHIVAL SCIENCE","volume":"25 2","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10502-025-09485-x.pdf","citationCount":"0","resultStr":"{\"title\":\"Overcoming data siloes in cultural heritage crime research: a consolidated OSINT-derived dataset on art, antiquities, and the trade in cultural goods\",\"authors\":\"Madison Leeson, Riccardo Giovanelli, Sara Ferro, Michela De Bernardin, Arianna Traviglia\",\"doi\":\"10.1007/s10502-025-09485-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The current landscape for provenance researchers, cultural heritage crime analysts, and law enforcement working in the culture sector is characterised by the siloing of data across dozens of databases, resulting in fragmented and incomplete resources that must be manually correlated and validated to provide insights into cultural heritage crime. The European Union-funded Research, Intelligence, and Technology for Heritage and Market Security (RITHMS) project is developing a platform to assist law enforcement agencies across Europe in tackling the illicit trafficking of cultural goods by aggregating open, specialised, and police data from a range of sources. This article outlines one of the initial phases of data collection, which has developed 30 tailored web scrapers for the collection of data from existing databases of stolen, missing, protected, and unprovenanced cultural goods. This has resulted in the largest known non-police dataset of these and associated data objects addressing the real-world challenge of data siloes in heritage crime and provenance research. This article details the multi-step process of data collection and pre-processing that has produced the novel consolidated dataset. The mechanism for knowledge discovery developed during this project has immediate applications and has resulted in actionable intelligence for the investigation of cultural goods crimes, highlighting the value of consolidated data as a resource. This research also offers a cursory analysis of the resulting dataset, demonstrating how mining of this resource can enable new scientific insights and offer promising opportunities for intelligence-led policing of cultural heritage crimes.</p></div>\",\"PeriodicalId\":46131,\"journal\":{\"name\":\"ARCHIVAL SCIENCE\",\"volume\":\"25 2\",\"pages\":\"\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-06-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s10502-025-09485-x.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ARCHIVAL SCIENCE\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10502-025-09485-x\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"INFORMATION SCIENCE & LIBRARY SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ARCHIVAL SCIENCE","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s10502-025-09485-x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
Overcoming data siloes in cultural heritage crime research: a consolidated OSINT-derived dataset on art, antiquities, and the trade in cultural goods
The current landscape for provenance researchers, cultural heritage crime analysts, and law enforcement working in the culture sector is characterised by the siloing of data across dozens of databases, resulting in fragmented and incomplete resources that must be manually correlated and validated to provide insights into cultural heritage crime. The European Union-funded Research, Intelligence, and Technology for Heritage and Market Security (RITHMS) project is developing a platform to assist law enforcement agencies across Europe in tackling the illicit trafficking of cultural goods by aggregating open, specialised, and police data from a range of sources. This article outlines one of the initial phases of data collection, which has developed 30 tailored web scrapers for the collection of data from existing databases of stolen, missing, protected, and unprovenanced cultural goods. This has resulted in the largest known non-police dataset of these and associated data objects addressing the real-world challenge of data siloes in heritage crime and provenance research. This article details the multi-step process of data collection and pre-processing that has produced the novel consolidated dataset. The mechanism for knowledge discovery developed during this project has immediate applications and has resulted in actionable intelligence for the investigation of cultural goods crimes, highlighting the value of consolidated data as a resource. This research also offers a cursory analysis of the resulting dataset, demonstrating how mining of this resource can enable new scientific insights and offer promising opportunities for intelligence-led policing of cultural heritage crimes.
期刊介绍:
Archival Science promotes the development of archival science as an autonomous scientific discipline. The journal covers all aspects of archival science theory, methodology, and practice. Moreover, it investigates different cultural approaches to creation, management and provision of access to archives, records, and data. It also seeks to promote the exchange and comparison of concepts, views and attitudes related to recordkeeping issues around the world.Archival Science''s approach is integrated, interdisciplinary, and intercultural. Its scope encompasses the entire field of recorded process-related information, analyzed in terms of form, structure, and context. To meet its objectives, the journal draws from scientific disciplines that deal with the function of records and the way they are created, preserved, and retrieved; the context in which information is generated, managed, and used; and the social and cultural environment of records creation at different times and places.Covers all aspects of archival science theory, methodology, and practiceInvestigates different cultural approaches to creation, management and provision of access to archives, records, and dataPromotes the exchange and comparison of concepts, views, and attitudes related to recordkeeping issues around the worldAddresses the entire field of recorded process-related information, analyzed in terms of form, structure, and context