大规模API废弃案例研究:国会暴乱后Parler数据泄露

2022 7th International Conference on Smart and Sustainable Technologies (SpliTech) Pub Date : 2022-07-05 DOI:10.23919/SpliTech55088.2022.9854293

David Redding, J. Ang, S. Bhunia

{"title":"大规模API废弃案例研究:国会暴乱后Parler数据泄露","authors":"David Redding, J. Ang, S. Bhunia","doi":"10.23919/SpliTech55088.2022.9854293","DOIUrl":null,"url":null,"abstract":"After the United States Capitol Hill Riots, there was a massive API scraping of Parler, an open social media platform, which resulted in 70 terabytes of user data being collected. The data breach, a serious confidential personal data leak, was not performed illegally. This paper analyzes the data breach and its impact in depth. The breach was a result of a hacktivist going with the alias @donk_enby, performing a massive API scraping of Parler's servers. The scraping took metadata from user's public, private, and previously deleted posts, uploaded to Parler's servers. Parler had failed to clear the metadata of these posts. The metadata contained names, dates, locations, and other data about the users who posted content to Parler's site. Over 70,000 GPS locations of Parler's users have been uncovered including users' private properties. These locations have also been used to tie citizens to the Capitol Riots if they uploaded any content about the riot from that day. Forms containing government identification of users were also leaked from Parler's servers that were used for account verification. This paper demonstrate background on the events leading up to, including, and following the Capitol Riots. The paper also examine the hacktivist's methodology for performing the API scraping and discuss possible defensive strategies such as API rate limiting, API request sanitation, and API call authorization.","PeriodicalId":295373,"journal":{"name":"2022 7th International Conference on Smart and Sustainable Technologies (SpliTech)","volume":"82 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A Case Study of Massive API Scrapping: Parler Data Breach After the Capitol Riot\",\"authors\":\"David Redding, J. Ang, S. Bhunia\",\"doi\":\"10.23919/SpliTech55088.2022.9854293\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"After the United States Capitol Hill Riots, there was a massive API scraping of Parler, an open social media platform, which resulted in 70 terabytes of user data being collected. The data breach, a serious confidential personal data leak, was not performed illegally. This paper analyzes the data breach and its impact in depth. The breach was a result of a hacktivist going with the alias @donk_enby, performing a massive API scraping of Parler's servers. The scraping took metadata from user's public, private, and previously deleted posts, uploaded to Parler's servers. Parler had failed to clear the metadata of these posts. The metadata contained names, dates, locations, and other data about the users who posted content to Parler's site. Over 70,000 GPS locations of Parler's users have been uncovered including users' private properties. These locations have also been used to tie citizens to the Capitol Riots if they uploaded any content about the riot from that day. Forms containing government identification of users were also leaked from Parler's servers that were used for account verification. This paper demonstrate background on the events leading up to, including, and following the Capitol Riots. The paper also examine the hacktivist's methodology for performing the API scraping and discuss possible defensive strategies such as API rate limiting, API request sanitation, and API call authorization.\",\"PeriodicalId\":295373,\"journal\":{\"name\":\"2022 7th International Conference on Smart and Sustainable Technologies (SpliTech)\",\"volume\":\"82 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 7th International Conference on Smart and Sustainable Technologies (SpliTech)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/SpliTech55088.2022.9854293\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 7th International Conference on Smart and Sustainable Technologies (SpliTech)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/SpliTech55088.2022.9854293","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

在美国国会山骚乱之后，对开放社交媒体平台Parler进行了大规模的API抓取，导致70tb的用户数据被收集。此次数据泄露是一起严重的个人机密数据泄露事件，并非非法行为。本文对数据泄露及其影响进行了深入分析。这次入侵是一个化名为@donk_enby的黑客主义者对Parler的服务器进行大规模API抓取的结果。从用户的公开、私人和之前删除的帖子中抓取元数据，上传到Parler的服务器上。Parler未能清除这些帖子的元数据。元数据包含在Parler网站上发布内容的用户的姓名、日期、地点和其他数据。超过70,000个Parler用户的GPS位置被发现，包括用户的私人财产。这些地点也被用来将公民与国会大厦骚乱联系起来，如果他们上传了任何关于当天骚乱的内容。包含政府用户身份的表格也从Parler的服务器泄露，用于账户验证。本文展示了导致国会大厦骚乱的背景，包括和之后的事件。本文还研究了黑客主义者执行API抓取的方法，并讨论了可能的防御策略，如API速率限制、API请求卫生和API调用授权。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Case Study of Massive API Scrapping: Parler Data Breach After the Capitol Riot

After the United States Capitol Hill Riots, there was a massive API scraping of Parler, an open social media platform, which resulted in 70 terabytes of user data being collected. The data breach, a serious confidential personal data leak, was not performed illegally. This paper analyzes the data breach and its impact in depth. The breach was a result of a hacktivist going with the alias @donk_enby, performing a massive API scraping of Parler's servers. The scraping took metadata from user's public, private, and previously deleted posts, uploaded to Parler's servers. Parler had failed to clear the metadata of these posts. The metadata contained names, dates, locations, and other data about the users who posted content to Parler's site. Over 70,000 GPS locations of Parler's users have been uncovered including users' private properties. These locations have also been used to tie citizens to the Capitol Riots if they uploaded any content about the riot from that day. Forms containing government identification of users were also leaked from Parler's servers that were used for account verification. This paper demonstrate background on the events leading up to, including, and following the Capitol Riots. The paper also examine the hacktivist's methodology for performing the API scraping and discuss possible defensive strategies such as API rate limiting, API request sanitation, and API call authorization.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 7th International Conference on Smart and Sustainable Technologies (SpliTech)

自引率

0.00%

发文量