Generative AI in peer review process for occupational health.

G H Lim, M L Tan, V C W Hoe, D Koh
{"title":"Generative AI in peer review process for occupational health.","authors":"G H Lim, M L Tan, V C W Hoe, D Koh","doi":"10.1093/occmed/kqaf051","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Generative Artificial Intelligence (AI) tools in academic writing can augment and speed up the proofing process by improving sections of the manuscript. This was the first known instance where the effectiveness and efficiency of Generative AI were quantified.</p><p><strong>Aims: </strong>To determine the effectiveness and efficiency of these tools in providing feedback and recommendations to the first drafts of eight published occupational health papers.</p><p><strong>Methods: </strong>To assess effectiveness, manuscripts were reviewed by Microsoft Copilot, ChatGPT (GPT-3.5), Google Gemini 1.0 and five human reviewers in February 2024. Anonymized reviews were scored by two expert panel members using a self-developed structured scoring system. The quality of feedback was rated on its relevance, completeness, accuracy and ability to identify errors and provide constructive feedback. The quality of recommendations was rated on relevance, completeness and accuracy. Efficiency was assessed via the time taken to complete each review. The mean, standard deviation (SD) and level of significance of any differences among the parameters were obtained.</p><p><strong>Results: </strong>Generative AI tools were significantly more effective (3.44, SD 0.77, P < 0.001) than human reviewers in providing feedback, while human reviewers performed significantly better (3.36, SD 0.71, P < 0.01) in providing recommendations. Generative AI tools were significantly more time-efficient and had the advantage of being always available. However, time/effort was required to verify the output for fictitious content.</p><p><strong>Conclusions: </strong>The utilization of Generative AI would improve the speed and accuracy of improving the manuscript prior to publication, leading to greater efficiencies in the dissemination of knowledge to the occupational health community.</p>","PeriodicalId":520727,"journal":{"name":"Occupational medicine (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Occupational medicine (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/occmed/kqaf051","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Generative Artificial Intelligence (AI) tools in academic writing can augment and speed up the proofing process by improving sections of the manuscript. This was the first known instance where the effectiveness and efficiency of Generative AI were quantified.

Aims: To determine the effectiveness and efficiency of these tools in providing feedback and recommendations to the first drafts of eight published occupational health papers.

Methods: To assess effectiveness, manuscripts were reviewed by Microsoft Copilot, ChatGPT (GPT-3.5), Google Gemini 1.0 and five human reviewers in February 2024. Anonymized reviews were scored by two expert panel members using a self-developed structured scoring system. The quality of feedback was rated on its relevance, completeness, accuracy and ability to identify errors and provide constructive feedback. The quality of recommendations was rated on relevance, completeness and accuracy. Efficiency was assessed via the time taken to complete each review. The mean, standard deviation (SD) and level of significance of any differences among the parameters were obtained.

Results: Generative AI tools were significantly more effective (3.44, SD 0.77, P < 0.001) than human reviewers in providing feedback, while human reviewers performed significantly better (3.36, SD 0.71, P < 0.01) in providing recommendations. Generative AI tools were significantly more time-efficient and had the advantage of being always available. However, time/effort was required to verify the output for fictitious content.

Conclusions: The utilization of Generative AI would improve the speed and accuracy of improving the manuscript prior to publication, leading to greater efficiencies in the dissemination of knowledge to the occupational health community.

生成人工智能在职业健康同行评议过程中的应用。
背景:学术写作中的生成式人工智能(AI)工具可以通过改进手稿的部分来增强和加快打样过程。这是已知的第一个量化生成人工智能的有效性和效率的例子。目的:确定这些工具在向八篇已发表的职业健康论文初稿提供反馈和建议方面的有效性和效率。方法:采用Microsoft Copilot、ChatGPT (GPT-3.5)、谷歌Gemini 1.0和5名人工审稿人于2024年2月对稿件进行评审,以评估其有效性。匿名评论由两名专家小组成员使用自行开发的结构化评分系统进行评分。反馈的质量是根据其相关性、完整性、准确性和识别错误和提供建设性反馈的能力来评定的。推荐的质量根据相关性、完整性和准确性进行评级。通过完成每次审查所花费的时间来评估效率。得到各参数间差异的均值、标准差和显著性水平。结果:生成式AI工具在提供反馈方面显著优于人工审稿人(3.44,SD 0.77, P < 0.001),而人工审稿人在提供推荐方面显著优于人工审稿人(3.36,SD 0.71, P < 0.01)。生成式人工智能工具的时间效率更高,并且具有始终可用的优势。但是,验证虚构内容的输出需要时间/精力。结论:使用生成式人工智能将提高出版前修改稿件的速度和准确性,从而提高向职业卫生界传播知识的效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信