Addressing Survey Fraud in Online Health Research: A Case Study of Latine Sexual Minority Men.

IF 2.4 4区医学 Q2 NURSING

Research in Nursing & Health Pub Date : 2025-09-26 DOI:10.1002/nur.70021

Lisvel A Matos, Susan Silva, Michael V Relf, Rosa Gonzalez-Guarda

{"title":"Addressing Survey Fraud in Online Health Research: A Case Study of Latine Sexual Minority Men.","authors":"Lisvel A Matos, Susan Silva, Michael V Relf, Rosa Gonzalez-Guarda","doi":"10.1002/nur.70021","DOIUrl":null,"url":null,"abstract":"<p><p>Online survey research has become an increasingly popular and effective method in the social sciences for exploring and addressing health-related issues. However, the increasing prevalence of fraudulent activities, particularly survey bots, threatens data integrity and can compromise health research by generating misleading data. The purpose of this paper was to describe the implementation of bot detection strategies in an online survey with Latine sexual minority men (SMM). Eleven bot detection indicators, including AI-detection software for open-ended responses, were used in two approaches to differentiate bot-generated from human responses. In the first approach, bot detection indicators were applied stepwise to identify valid entries. In the second approach, a fraud detection algorithm was used to identify three fraud categories. Key demographics and study variables were compared across fraud categories using chi-square/Fisher's Exact tests for categorical data and Kruskal-Wallis tests for continuous data (significance set at 0.05). Of the 1147 total survey entries, 837 (73%) completed at least 20% of the survey (814 completed all items). A total of 739 (88%) of the 837 completed surveys were classified as fraudulent. Among the 837 completed surveys, 333 (40%) had an AI-generated open-ended response and fast completion time (≤ 20 min) and 234 (28%) entries were flagged for all three of these indicators. Sociodemographic characteristics and HIV prevention outcomes were largely similar across bot-generated and human responses. Findings suggest that survey bots are a pervasive threat to online research and are effective at providing human-like responses. To protect data integrity and ensure the development of effective health policies and interventions, health science researchers should adopt comprehensive bot detection and prevention strategies.</p>","PeriodicalId":54492,"journal":{"name":"Research in Nursing & Health","volume":" ","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research in Nursing & Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/nur.70021","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"NURSING","Score":null,"Total":0}

引用次数: 0

Abstract

Online survey research has become an increasingly popular and effective method in the social sciences for exploring and addressing health-related issues. However, the increasing prevalence of fraudulent activities, particularly survey bots, threatens data integrity and can compromise health research by generating misleading data. The purpose of this paper was to describe the implementation of bot detection strategies in an online survey with Latine sexual minority men (SMM). Eleven bot detection indicators, including AI-detection software for open-ended responses, were used in two approaches to differentiate bot-generated from human responses. In the first approach, bot detection indicators were applied stepwise to identify valid entries. In the second approach, a fraud detection algorithm was used to identify three fraud categories. Key demographics and study variables were compared across fraud categories using chi-square/Fisher's Exact tests for categorical data and Kruskal-Wallis tests for continuous data (significance set at 0.05). Of the 1147 total survey entries, 837 (73%) completed at least 20% of the survey (814 completed all items). A total of 739 (88%) of the 837 completed surveys were classified as fraudulent. Among the 837 completed surveys, 333 (40%) had an AI-generated open-ended response and fast completion time (≤ 20 min) and 234 (28%) entries were flagged for all three of these indicators. Sociodemographic characteristics and HIV prevention outcomes were largely similar across bot-generated and human responses. Findings suggest that survey bots are a pervasive threat to online research and are effective at providing human-like responses. To protect data integrity and ensure the development of effective health policies and interventions, health science researchers should adopt comprehensive bot detection and prevention strategies.

查看原文本刊更多论文

解决在线健康研究中的调查欺诈：拉丁裔性少数男性的案例研究。

在线调查研究已成为社会科学中探索和解决健康相关问题的一种日益流行和有效的方法。然而，越来越普遍的欺诈活动，特别是调查机器人，威胁到数据的完整性，并可能通过产生误导性数据而损害卫生研究。本文的目的是描述在拉丁性少数男性（SMM）在线调查中机器人检测策略的实施。11个机器人检测指标，包括用于开放式响应的人工智能检测软件，在两种方法中用于区分机器人生成的响应和人类响应。在第一种方法中，应用机器人检测指标逐步识别有效条目。在第二种方法中，使用欺诈检测算法来识别三种欺诈类别。对分类数据使用卡方/Fisher's精确检验，对连续数据使用Kruskal-Wallis检验（显著性设置为0.05），对不同欺诈类别的关键人口统计学和研究变量进行比较。在总共1147个调查条目中，837个（73%）完成了至少20%的调查（814个完成了所有项目）。在837份已完成的调查中，共有739份（88%）被列为欺诈。在完成的837项调查中，333项（40%）具有人工智能生成的开放式响应和快速完成时间（≤20分钟），234项（28%）的条目被标记为所有这三个指标。社会人口学特征和艾滋病毒预防结果在机器人和人类反应中基本相似。调查结果表明，调查机器人对在线研究是一种普遍的威胁，在提供类似人类的回答方面很有效。为了保护数据完整性并确保制定有效的卫生政策和干预措施，卫生科学研究人员应采取全面的僵尸检测和预防策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Research in Nursing & Health 医学-护理

CiteScore

3.90

自引率

0.00%

发文量

审稿时长

6-12 weeks

期刊介绍： Research in Nursing & Health ( RINAH ) is a peer-reviewed general research journal devoted to publication of a wide range of research that will inform the practice of nursing and other health disciplines. The editors invite reports of research describing problems and testing interventions related to health phenomena, health care and self-care, clinical organization and administration; and the testing of research findings in practice. Research protocols are considered if funded in a peer-reviewed process by an agency external to the authors’ home institution and if the work is in progress. Papers on research methods and techniques are appropriate if they go beyond what is already generally available in the literature and include description of successful use of the method. Theory papers are accepted if each proposition is supported by research evidence. Systematic reviews of the literature are reviewed if PRISMA guidelines are followed. Letters to the editor commenting on published articles are welcome.