Nicholas Landi, Elizabeth Lee, Karolina Naranjo-Velasco, Felipe Barraza
{"title":"用自动化数据管道架构调查文化财产非法贸易","authors":"Nicholas Landi, Elizabeth Lee, Karolina Naranjo-Velasco, Felipe Barraza","doi":"10.1109/sieds55548.2022.9799367","DOIUrl":null,"url":null,"abstract":"The scale of global art crime has been difficult to quantify due to the vast number of transactions and varying methods of trade. Although online marketplace platforms such as eBay offer promising data to study and track this illicit market, this relationship has not been systematically studied due to the highly technical nature of compiling and wrangling these data. This research project partners with the Cultural Resilience Informatics and Analysis (CURIA) Lab to design a robust data pipeline that collects, processes, and stores data from eBay to quantify and analyze the network mobility of illicit cultural property. The data pipeline consists of a template for accessing eBay's API, understanding API documentation, and collecting necessary features for network analysis. This process represents the first data pipeline architecture to our knowledge that collects data from listings across categories of interest, and stores features in a SQLite database through an automated, recursive script for social science research. The metadata for building and maintaining the data pipeline is recorded in an in-depth guide. The result of this data pipeline framework is a replicable blueprint for interacting with an online marketplace's API environment. This project will act as a precursor to begin research regarding the global trade of illicit cultural property through subsequent network and spatial analysis.","PeriodicalId":286724,"journal":{"name":"2022 Systems and Information Engineering Design Symposium (SIEDS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Investigating the Illicit Trade of Cultural Property with an Automated Data Pipeline Architecture\",\"authors\":\"Nicholas Landi, Elizabeth Lee, Karolina Naranjo-Velasco, Felipe Barraza\",\"doi\":\"10.1109/sieds55548.2022.9799367\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The scale of global art crime has been difficult to quantify due to the vast number of transactions and varying methods of trade. Although online marketplace platforms such as eBay offer promising data to study and track this illicit market, this relationship has not been systematically studied due to the highly technical nature of compiling and wrangling these data. This research project partners with the Cultural Resilience Informatics and Analysis (CURIA) Lab to design a robust data pipeline that collects, processes, and stores data from eBay to quantify and analyze the network mobility of illicit cultural property. The data pipeline consists of a template for accessing eBay's API, understanding API documentation, and collecting necessary features for network analysis. This process represents the first data pipeline architecture to our knowledge that collects data from listings across categories of interest, and stores features in a SQLite database through an automated, recursive script for social science research. The metadata for building and maintaining the data pipeline is recorded in an in-depth guide. The result of this data pipeline framework is a replicable blueprint for interacting with an online marketplace's API environment. This project will act as a precursor to begin research regarding the global trade of illicit cultural property through subsequent network and spatial analysis.\",\"PeriodicalId\":286724,\"journal\":{\"name\":\"2022 Systems and Information Engineering Design Symposium (SIEDS)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 Systems and Information Engineering Design Symposium (SIEDS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/sieds55548.2022.9799367\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Systems and Information Engineering Design Symposium (SIEDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/sieds55548.2022.9799367","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Investigating the Illicit Trade of Cultural Property with an Automated Data Pipeline Architecture
The scale of global art crime has been difficult to quantify due to the vast number of transactions and varying methods of trade. Although online marketplace platforms such as eBay offer promising data to study and track this illicit market, this relationship has not been systematically studied due to the highly technical nature of compiling and wrangling these data. This research project partners with the Cultural Resilience Informatics and Analysis (CURIA) Lab to design a robust data pipeline that collects, processes, and stores data from eBay to quantify and analyze the network mobility of illicit cultural property. The data pipeline consists of a template for accessing eBay's API, understanding API documentation, and collecting necessary features for network analysis. This process represents the first data pipeline architecture to our knowledge that collects data from listings across categories of interest, and stores features in a SQLite database through an automated, recursive script for social science research. The metadata for building and maintaining the data pipeline is recorded in an in-depth guide. The result of this data pipeline framework is a replicable blueprint for interacting with an online marketplace's API environment. This project will act as a precursor to begin research regarding the global trade of illicit cultural property through subsequent network and spatial analysis.