Early detection of fraudulent COVID-19 products from Twitter chatter

IF 3.5 Q1 HEALTH CARE SCIENCES & SERVICES

JMIR infodemiology Pub Date : 2022-05-11 DOI:10.1101/2022.05.09.22274776

A. Sarker, S. Lakamana, R. Liao, A. Abbas, Y.-C. Yang, M. Al-garadi

{"title":"Early detection of fraudulent COVID-19 products from Twitter chatter","authors":"A. Sarker, S. Lakamana, R. Liao, A. Abbas, Y.-C. Yang, M. Al-garadi","doi":"10.1101/2022.05.09.22274776","DOIUrl":null,"url":null,"abstract":"Social media have served as lucrative platforms for misinformation and for promoting fraudulent products for the treatment, testing and prevention of COVID-19. This has resulted in the issuance of many warning letters by the United States Food and Drug Administration (FDA). While social media continue to serve as the primary platform for the promotion of such fraudulent products, they also present the opportunity to identify these products early by employing effective social media mining methods. In this study, we employ natural language processing and time series anomaly detection methods for automatically detecting fraudulent COVID-19 products early from Twitter. Our approach is based on the intuition that increases in the popularity of fraudulent products lead to corresponding anomalous increases in the volume of chatter regarding them. We utilized an anomaly detection method on streaming COVID-19-related Twitter data to detect potentially anomalous increases in mentions of fraudulent products. Our unsupervised approach detected 34/44 (77.3%) signals about fraudulent products earlier than the FDA letter issuance dates, and an additional 6/44 (13.6%) within a week following the corresponding FDA letters. Our proposed method is simple, effective and easy to deploy, and do not require high performance computing machinery unlike deep neural network-based methods.","PeriodicalId":73554,"journal":{"name":"JMIR infodemiology","volume":" ","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2022-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR infodemiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2022.05.09.22274776","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Social media have served as lucrative platforms for misinformation and for promoting fraudulent products for the treatment, testing and prevention of COVID-19. This has resulted in the issuance of many warning letters by the United States Food and Drug Administration (FDA). While social media continue to serve as the primary platform for the promotion of such fraudulent products, they also present the opportunity to identify these products early by employing effective social media mining methods. In this study, we employ natural language processing and time series anomaly detection methods for automatically detecting fraudulent COVID-19 products early from Twitter. Our approach is based on the intuition that increases in the popularity of fraudulent products lead to corresponding anomalous increases in the volume of chatter regarding them. We utilized an anomaly detection method on streaming COVID-19-related Twitter data to detect potentially anomalous increases in mentions of fraudulent products. Our unsupervised approach detected 34/44 (77.3%) signals about fraudulent products earlier than the FDA letter issuance dates, and an additional 6/44 (13.6%) within a week following the corresponding FDA letters. Our proposed method is simple, effective and easy to deploy, and do not require high performance computing machinery unlike deep neural network-based methods.

查看原文本刊更多论文

从推特聊天中及早发现新冠肺炎欺诈产品

社交媒体一直是虚假信息和推广新冠肺炎治疗、检测和预防欺诈产品的有利可图的平台。这导致美国食品药品监督管理局（FDA）发出了许多警告信。虽然社交媒体仍然是推广此类欺诈产品的主要平台，但它们也提供了通过采用有效的社交媒体挖掘方法尽早识别这些产品的机会。在这项研究中，我们采用自然语言处理和时间序列异常检测方法，从推特早期自动检测欺诈性新冠肺炎产品。我们的方法基于这样一种直觉，即欺诈产品受欢迎程度的增加会导致相关聊天量的异常增加。我们对与新冠肺炎相关的推特数据流使用了异常检测方法，以检测欺诈产品提及量的潜在异常增加。我们的无监督方法在美国食品药品监督管理局信函发布日期之前检测到34/44（77.3%）关于欺诈产品的信号，在相应的美国食品药品管理局信函发出后的一周内又检测到6/44（13.6%）。与基于深度神经网络的方法不同，我们提出的方法简单、有效且易于部署，并且不需要高性能的计算机器。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

JMIR infodemiology

CiteScore

4.80

自引率

0.00%

发文量