S. M. Supundrika Subasinghe, Simon G. Gersib, Thomas M. Frueh and Neal P. Mankad*,
{"title":"Can Large Language Models (LLMs) Act as Virtual Safety Officers?","authors":"S. M. Supundrika Subasinghe, Simon G. Gersib, Thomas M. Frueh and Neal P. Mankad*, ","doi":"10.1021/acs.chas.4c0009710.1021/acs.chas.4c00097","DOIUrl":null,"url":null,"abstract":"<p >This study examines the reliability of artificial intelligence (AI) systems─specifically, the large language models (LLMs) ChatGPT, Copilot, and Gemini─to provide accurate lab safety advice, a critical need in high-risk environments. We evaluated LLM performance in addressing several chemical safety queries relevant to academic chemistry laboratories across the criteria of accuracy, relevance, clarity, completeness, and engagement. While all the LLMs tested generally delivered clear and accurate guidance, some shortcomings were identified, raising concerns about reliability during safety emergencies or for nonexpert users. Despite these issues, the findings suggest that with further refinement, AI has the potential to become a valuable tool for lab safety that is complementary to a human laboratory safety officer.</p>","PeriodicalId":73648,"journal":{"name":"Journal of chemical health & safety","volume":"32 1","pages":"39–47 39–47"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of chemical health & safety","FirstCategoryId":"1085","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.chas.4c00097","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This study examines the reliability of artificial intelligence (AI) systems─specifically, the large language models (LLMs) ChatGPT, Copilot, and Gemini─to provide accurate lab safety advice, a critical need in high-risk environments. We evaluated LLM performance in addressing several chemical safety queries relevant to academic chemistry laboratories across the criteria of accuracy, relevance, clarity, completeness, and engagement. While all the LLMs tested generally delivered clear and accurate guidance, some shortcomings were identified, raising concerns about reliability during safety emergencies or for nonexpert users. Despite these issues, the findings suggest that with further refinement, AI has the potential to become a valuable tool for lab safety that is complementary to a human laboratory safety officer.