Nicholas Landi, Elizabeth Lee, Karolina Naranjo-Velasco, Felipe Barraza
{"title":"Investigating the Illicit Trade of Cultural Property with an Automated Data Pipeline Architecture","authors":"Nicholas Landi, Elizabeth Lee, Karolina Naranjo-Velasco, Felipe Barraza","doi":"10.1109/sieds55548.2022.9799367","DOIUrl":null,"url":null,"abstract":"The scale of global art crime has been difficult to quantify due to the vast number of transactions and varying methods of trade. Although online marketplace platforms such as eBay offer promising data to study and track this illicit market, this relationship has not been systematically studied due to the highly technical nature of compiling and wrangling these data. This research project partners with the Cultural Resilience Informatics and Analysis (CURIA) Lab to design a robust data pipeline that collects, processes, and stores data from eBay to quantify and analyze the network mobility of illicit cultural property. The data pipeline consists of a template for accessing eBay's API, understanding API documentation, and collecting necessary features for network analysis. This process represents the first data pipeline architecture to our knowledge that collects data from listings across categories of interest, and stores features in a SQLite database through an automated, recursive script for social science research. The metadata for building and maintaining the data pipeline is recorded in an in-depth guide. The result of this data pipeline framework is a replicable blueprint for interacting with an online marketplace's API environment. This project will act as a precursor to begin research regarding the global trade of illicit cultural property through subsequent network and spatial analysis.","PeriodicalId":286724,"journal":{"name":"2022 Systems and Information Engineering Design Symposium (SIEDS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Systems and Information Engineering Design Symposium (SIEDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/sieds55548.2022.9799367","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The scale of global art crime has been difficult to quantify due to the vast number of transactions and varying methods of trade. Although online marketplace platforms such as eBay offer promising data to study and track this illicit market, this relationship has not been systematically studied due to the highly technical nature of compiling and wrangling these data. This research project partners with the Cultural Resilience Informatics and Analysis (CURIA) Lab to design a robust data pipeline that collects, processes, and stores data from eBay to quantify and analyze the network mobility of illicit cultural property. The data pipeline consists of a template for accessing eBay's API, understanding API documentation, and collecting necessary features for network analysis. This process represents the first data pipeline architecture to our knowledge that collects data from listings across categories of interest, and stores features in a SQLite database through an automated, recursive script for social science research. The metadata for building and maintaining the data pipeline is recorded in an in-depth guide. The result of this data pipeline framework is a replicable blueprint for interacting with an online marketplace's API environment. This project will act as a precursor to begin research regarding the global trade of illicit cultural property through subsequent network and spatial analysis.