A Scoping Literature Review of Natural Language Processing Application to Safety Occurrence Reports

IF 1.8 Q3 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH

Safety Pub Date : 2023-04-05 DOI:10.3390/safety9020022

John W. Ricketts, Dave Barry, Weisi Guo, Jonathan Pelham

{"title":"A Scoping Literature Review of Natural Language Processing Application to Safety Occurrence Reports","authors":"John W. Ricketts, Dave Barry, Weisi Guo, Jonathan Pelham","doi":"10.3390/safety9020022","DOIUrl":null,"url":null,"abstract":"Safety occurrence reports can contain valuable information on how incidents occur, revealing knowledge that can assist safety practitioners. This paper presents and discusses a literature review exploring how Natural Language Processing (NLP) has been applied to occurrence reports within safety-critical industries, informing further research on the topic and highlighting common challenges. Some of the uses of NLP include the ability for occurrence reports to be automatically classified against categories, and entities such as causes and consequences to be extracted from the text as well as the semantic searching of occurrence databases. The review revealed that machine learning models form the dominant method when applying NLP, although rule-based algorithms still provide a viable option for some entity extraction tasks. Recent advances in deep learning models such as Bidirectional Transformers for Language Understanding are now achieving a high accuracy while eliminating the need to substantially pre-process text. The construction of safety-themed datasets would be of benefit for the application of NLP to occurrence reporting, as this would allow the fine-tuning of current language models to safety tasks. An interesting approach is the use of topic modelling, which represents a shift away from the prescriptive classification taxonomies, splitting data into “topics”. Where many papers focus on the computational accuracy of models, they would also benefit from real-world trials to further inform usefulness. It is anticipated that NLP will soon become a mainstream tool used by safety practitioners to efficiently process and gain knowledge from safety-related text.","PeriodicalId":36827,"journal":{"name":"Safety","volume":" ","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Safety","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/safety9020022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}

引用次数: 2

Abstract

Safety occurrence reports can contain valuable information on how incidents occur, revealing knowledge that can assist safety practitioners. This paper presents and discusses a literature review exploring how Natural Language Processing (NLP) has been applied to occurrence reports within safety-critical industries, informing further research on the topic and highlighting common challenges. Some of the uses of NLP include the ability for occurrence reports to be automatically classified against categories, and entities such as causes and consequences to be extracted from the text as well as the semantic searching of occurrence databases. The review revealed that machine learning models form the dominant method when applying NLP, although rule-based algorithms still provide a viable option for some entity extraction tasks. Recent advances in deep learning models such as Bidirectional Transformers for Language Understanding are now achieving a high accuracy while eliminating the need to substantially pre-process text. The construction of safety-themed datasets would be of benefit for the application of NLP to occurrence reporting, as this would allow the fine-tuning of current language models to safety tasks. An interesting approach is the use of topic modelling, which represents a shift away from the prescriptive classification taxonomies, splitting data into “topics”. Where many papers focus on the computational accuracy of models, they would also benefit from real-world trials to further inform usefulness. It is anticipated that NLP will soon become a mainstream tool used by safety practitioners to efficiently process and gain knowledge from safety-related text.

查看原文本刊更多论文

自然语言处理在安全事故报告中的应用综述

安全事故报告可以包含关于事故如何发生的有价值的信息，揭示可以帮助安全从业人员的知识。本文介绍并讨论了一篇文献综述，探讨了自然语言处理(NLP)如何应用于安全关键行业的事故报告，为该主题的进一步研究提供了信息，并突出了共同的挑战。NLP的一些用途包括根据类别对事件报告进行自动分类的能力，以及从文本中提取原因和后果等实体的能力，以及对事件数据库进行语义搜索的能力。回顾表明，机器学习模型是应用自然语言处理的主要方法，尽管基于规则的算法仍然为一些实体提取任务提供了可行的选择。深度学习模型的最新进展，如用于语言理解的双向变形器，现在实现了高精度，同时消除了对文本进行大量预处理的需要。安全主题数据集的构建将有利于NLP在事件报告中的应用，因为这将允许对当前的语言模型进行安全任务的微调。一个有趣的方法是使用主题建模，它代表了从规定性分类分类法的转变，将数据划分为“主题”。当许多论文关注模型的计算精度时，它们也将受益于现实世界的试验，以进一步提供有用的信息。预计NLP将很快成为安全从业者使用的主流工具，以有效地处理和获取与安全相关的文本知识。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊