Alejandro García-Rudolph, David Sanchez-Pinsach, Javier Remacha, Sheila Patricio, Eloy Opisso
{"title":"ChatGPT as a rising force: Can AI bridge information gaps in Occupational Risk Prevention?","authors":"Alejandro García-Rudolph, David Sanchez-Pinsach, Javier Remacha, Sheila Patricio, Eloy Opisso","doi":"10.1177/10519815251348355","DOIUrl":null,"url":null,"abstract":"<p><p>BackgroundLack of information is a critical challenge in occupational health. With over 180 million users, ChatGPT has become a prominent trend, swiftly addressing a wide array of queries, yet it critically needs validation in occupational health.ObjectiveThis study evaluated GPT-3.5 (free version) and GPT-4 (paid version) on their ability to respond to Occupational Risk Prevention formal multiple-choice questions.MethodsA total of 303 questions were assessed, categorized across four levels of complexity-task-specific, national, European, and global-within various Spanish regions.ResultsGPT-3.5 achieved an overall accuracy of 56.8%, while GPT-4 reached 73.9% (p < 0.001). GPT-3.5 showed particularly limited performance on domain-specific content. Both models shared similar error patterns, with incorrect response rates ranging from 18-24% across regions.ConclusionDespite GPT-4's improved performance, both models display notable limitations in occupational health applications. To enhance reliability, four strategies are proposed: formal validation, continuous training, error analysis, and regional adaptation.</p>","PeriodicalId":51373,"journal":{"name":"Work-A Journal of Prevention Assessment & Rehabilitation","volume":" ","pages":"10519815251348355"},"PeriodicalIF":1.7000,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Work-A Journal of Prevention Assessment & Rehabilitation","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/10519815251348355","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
Abstract
BackgroundLack of information is a critical challenge in occupational health. With over 180 million users, ChatGPT has become a prominent trend, swiftly addressing a wide array of queries, yet it critically needs validation in occupational health.ObjectiveThis study evaluated GPT-3.5 (free version) and GPT-4 (paid version) on their ability to respond to Occupational Risk Prevention formal multiple-choice questions.MethodsA total of 303 questions were assessed, categorized across four levels of complexity-task-specific, national, European, and global-within various Spanish regions.ResultsGPT-3.5 achieved an overall accuracy of 56.8%, while GPT-4 reached 73.9% (p < 0.001). GPT-3.5 showed particularly limited performance on domain-specific content. Both models shared similar error patterns, with incorrect response rates ranging from 18-24% across regions.ConclusionDespite GPT-4's improved performance, both models display notable limitations in occupational health applications. To enhance reliability, four strategies are proposed: formal validation, continuous training, error analysis, and regional adaptation.
期刊介绍:
WORK: A Journal of Prevention, Assessment & Rehabilitation is an interdisciplinary, international journal which publishes high quality peer-reviewed manuscripts covering the entire scope of the occupation of work. The journal''s subtitle has been deliberately laid out: The first goal is the prevention of illness, injury, and disability. When this goal is not achievable, the attention focuses on assessment to design client-centered intervention, rehabilitation, treatment, or controls that use scientific evidence to support best practice.