{"title":"评估在线症状检查器对危险信号的使用情况。","authors":"Shailen Sutaria, Delanjathan Devakumar, Poppy Mallinson, Sanjay Kinra, Tamer T Malak, Andras Meczner","doi":"10.1186/s12913-025-13353-w","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Online Symptom Checkers (OSCs) are digital health tools providing triage, diagnostic, and self-care advice based on user reported symptoms. Amidst global trends of increasing demand and workforce shortages, OSCs have the potential to alleviate primary care workload. However, their ability to seek red flag symptoms, a critical marker of a safe consultation in primary care, remains unexplored. Using clinical vignettes, this study evaluates OSCs' performance in seeking red flag symptoms compared to Primary Care Physicians (PCPs).</p><p><strong>Methods: </strong>Four OSCs (Ada, Babylon, Symptomate, Healthily) were evaluated using 51 clinical vignettes. Two standard setters used guidelines to determine which vignettes required emergency triage and identified the relevant red flags symptoms for the remaining vignettes. Two laypersons entered data from vignettes into OSCs and outputs were collected following a standardised form. The same vignettes were independently assessed by PCPs to compare triage accuracy and red flag identification. Summary statistics and 95% confidence intervals were calculated using Wilson Score intervals, and Fisher's exact test was used to compare performance between OSCs and PCPs.</p><p><strong>Results: </strong>Of the 51 clinical vignettes, standard setters determined 14 to require emergency triage and the remaining 37 vignettes suitable for primary care triage. Of the primary care triaged vignettes, standard setters identified a total of 77 relevant red flag symptoms to be sought. Of the 14 emergency vignettes, PCPs correctly triaged 85.7% (95% CI: 74.3-92.6%) of cases compared to OSCs 76.9% (95% CI: 59.3-87.9%), with no statistically significant difference (p = 0.299). Specificity, the proportion of correctly triaged primary care vignettes, PCPs performed significantly better compared to OSCs, 91.9% (95%CI 78.9-97.0%) vs. 83.3% (95%CI 68.1-91.9%), p = 0.024.</p><p><strong>Conclusions: </strong>OSCs demonstrated comparable ability to appropriately triage clinical vignettes requiring emergency triage as PCPs, however, were less specific, triaging more primary care vignettes as emergency. OSCs do not seek the majority of red flags. This raises concerns about their safety and effectiveness in primary care. OSCs developers should focus on improving OSCs' red flag coverage to ensure safe integration into primary care settings.</p>","PeriodicalId":9012,"journal":{"name":"BMC Health Services Research","volume":"25 1","pages":"1263"},"PeriodicalIF":3.0000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12486864/pdf/","citationCount":"0","resultStr":"{\"title\":\"Evaluating the use of red flags by online symptom checkers.\",\"authors\":\"Shailen Sutaria, Delanjathan Devakumar, Poppy Mallinson, Sanjay Kinra, Tamer T Malak, Andras Meczner\",\"doi\":\"10.1186/s12913-025-13353-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Online Symptom Checkers (OSCs) are digital health tools providing triage, diagnostic, and self-care advice based on user reported symptoms. Amidst global trends of increasing demand and workforce shortages, OSCs have the potential to alleviate primary care workload. However, their ability to seek red flag symptoms, a critical marker of a safe consultation in primary care, remains unexplored. Using clinical vignettes, this study evaluates OSCs' performance in seeking red flag symptoms compared to Primary Care Physicians (PCPs).</p><p><strong>Methods: </strong>Four OSCs (Ada, Babylon, Symptomate, Healthily) were evaluated using 51 clinical vignettes. Two standard setters used guidelines to determine which vignettes required emergency triage and identified the relevant red flags symptoms for the remaining vignettes. Two laypersons entered data from vignettes into OSCs and outputs were collected following a standardised form. The same vignettes were independently assessed by PCPs to compare triage accuracy and red flag identification. Summary statistics and 95% confidence intervals were calculated using Wilson Score intervals, and Fisher's exact test was used to compare performance between OSCs and PCPs.</p><p><strong>Results: </strong>Of the 51 clinical vignettes, standard setters determined 14 to require emergency triage and the remaining 37 vignettes suitable for primary care triage. Of the primary care triaged vignettes, standard setters identified a total of 77 relevant red flag symptoms to be sought. Of the 14 emergency vignettes, PCPs correctly triaged 85.7% (95% CI: 74.3-92.6%) of cases compared to OSCs 76.9% (95% CI: 59.3-87.9%), with no statistically significant difference (p = 0.299). Specificity, the proportion of correctly triaged primary care vignettes, PCPs performed significantly better compared to OSCs, 91.9% (95%CI 78.9-97.0%) vs. 83.3% (95%CI 68.1-91.9%), p = 0.024.</p><p><strong>Conclusions: </strong>OSCs demonstrated comparable ability to appropriately triage clinical vignettes requiring emergency triage as PCPs, however, were less specific, triaging more primary care vignettes as emergency. OSCs do not seek the majority of red flags. This raises concerns about their safety and effectiveness in primary care. OSCs developers should focus on improving OSCs' red flag coverage to ensure safe integration into primary care settings.</p>\",\"PeriodicalId\":9012,\"journal\":{\"name\":\"BMC Health Services Research\",\"volume\":\"25 1\",\"pages\":\"1263\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12486864/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Health Services Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12913-025-13353-w\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Health Services Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12913-025-13353-w","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
Evaluating the use of red flags by online symptom checkers.
Background: Online Symptom Checkers (OSCs) are digital health tools providing triage, diagnostic, and self-care advice based on user reported symptoms. Amidst global trends of increasing demand and workforce shortages, OSCs have the potential to alleviate primary care workload. However, their ability to seek red flag symptoms, a critical marker of a safe consultation in primary care, remains unexplored. Using clinical vignettes, this study evaluates OSCs' performance in seeking red flag symptoms compared to Primary Care Physicians (PCPs).
Methods: Four OSCs (Ada, Babylon, Symptomate, Healthily) were evaluated using 51 clinical vignettes. Two standard setters used guidelines to determine which vignettes required emergency triage and identified the relevant red flags symptoms for the remaining vignettes. Two laypersons entered data from vignettes into OSCs and outputs were collected following a standardised form. The same vignettes were independently assessed by PCPs to compare triage accuracy and red flag identification. Summary statistics and 95% confidence intervals were calculated using Wilson Score intervals, and Fisher's exact test was used to compare performance between OSCs and PCPs.
Results: Of the 51 clinical vignettes, standard setters determined 14 to require emergency triage and the remaining 37 vignettes suitable for primary care triage. Of the primary care triaged vignettes, standard setters identified a total of 77 relevant red flag symptoms to be sought. Of the 14 emergency vignettes, PCPs correctly triaged 85.7% (95% CI: 74.3-92.6%) of cases compared to OSCs 76.9% (95% CI: 59.3-87.9%), with no statistically significant difference (p = 0.299). Specificity, the proportion of correctly triaged primary care vignettes, PCPs performed significantly better compared to OSCs, 91.9% (95%CI 78.9-97.0%) vs. 83.3% (95%CI 68.1-91.9%), p = 0.024.
Conclusions: OSCs demonstrated comparable ability to appropriately triage clinical vignettes requiring emergency triage as PCPs, however, were less specific, triaging more primary care vignettes as emergency. OSCs do not seek the majority of red flags. This raises concerns about their safety and effectiveness in primary care. OSCs developers should focus on improving OSCs' red flag coverage to ensure safe integration into primary care settings.
期刊介绍:
BMC Health Services Research is an open access, peer-reviewed journal that considers articles on all aspects of health services research, including delivery of care, management of health services, assessment of healthcare needs, measurement of outcomes, allocation of healthcare resources, evaluation of different health markets and health services organizations, international comparative analysis of health systems, health economics and the impact of health policies and regulations.