{"title":"Generative AI in peer review process for occupational health.","authors":"G H Lim, M L Tan, V C W Hoe, D Koh","doi":"10.1093/occmed/kqaf051","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Generative Artificial Intelligence (AI) tools in academic writing can augment and speed up the proofing process by improving sections of the manuscript. This was the first known instance where the effectiveness and efficiency of Generative AI were quantified.</p><p><strong>Aims: </strong>To determine the effectiveness and efficiency of these tools in providing feedback and recommendations to the first drafts of eight published occupational health papers.</p><p><strong>Methods: </strong>To assess effectiveness, manuscripts were reviewed by Microsoft Copilot, ChatGPT (GPT-3.5), Google Gemini 1.0 and five human reviewers in February 2024. Anonymized reviews were scored by two expert panel members using a self-developed structured scoring system. The quality of feedback was rated on its relevance, completeness, accuracy and ability to identify errors and provide constructive feedback. The quality of recommendations was rated on relevance, completeness and accuracy. Efficiency was assessed via the time taken to complete each review. The mean, standard deviation (SD) and level of significance of any differences among the parameters were obtained.</p><p><strong>Results: </strong>Generative AI tools were significantly more effective (3.44, SD 0.77, P < 0.001) than human reviewers in providing feedback, while human reviewers performed significantly better (3.36, SD 0.71, P < 0.01) in providing recommendations. Generative AI tools were significantly more time-efficient and had the advantage of being always available. However, time/effort was required to verify the output for fictitious content.</p><p><strong>Conclusions: </strong>The utilization of Generative AI would improve the speed and accuracy of improving the manuscript prior to publication, leading to greater efficiencies in the dissemination of knowledge to the occupational health community.</p>","PeriodicalId":520727,"journal":{"name":"Occupational medicine (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Occupational medicine (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/occmed/kqaf051","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Generative Artificial Intelligence (AI) tools in academic writing can augment and speed up the proofing process by improving sections of the manuscript. This was the first known instance where the effectiveness and efficiency of Generative AI were quantified.
Aims: To determine the effectiveness and efficiency of these tools in providing feedback and recommendations to the first drafts of eight published occupational health papers.
Methods: To assess effectiveness, manuscripts were reviewed by Microsoft Copilot, ChatGPT (GPT-3.5), Google Gemini 1.0 and five human reviewers in February 2024. Anonymized reviews were scored by two expert panel members using a self-developed structured scoring system. The quality of feedback was rated on its relevance, completeness, accuracy and ability to identify errors and provide constructive feedback. The quality of recommendations was rated on relevance, completeness and accuracy. Efficiency was assessed via the time taken to complete each review. The mean, standard deviation (SD) and level of significance of any differences among the parameters were obtained.
Results: Generative AI tools were significantly more effective (3.44, SD 0.77, P < 0.001) than human reviewers in providing feedback, while human reviewers performed significantly better (3.36, SD 0.71, P < 0.01) in providing recommendations. Generative AI tools were significantly more time-efficient and had the advantage of being always available. However, time/effort was required to verify the output for fictitious content.
Conclusions: The utilization of Generative AI would improve the speed and accuracy of improving the manuscript prior to publication, leading to greater efficiencies in the dissemination of knowledge to the occupational health community.