BERT for Aviation Text Classification

Xiao Jing, Akul Chennakesavan, Chetan Chandra, Mayank V. Bendarkar, Michelle R. Kirby, D. Mavris
{"title":"BERT for Aviation Text Classification","authors":"Xiao Jing, Akul Chennakesavan, Chetan Chandra, Mayank V. Bendarkar, Michelle R. Kirby, D. Mavris","doi":"10.2514/6.2023-3438","DOIUrl":null,"url":null,"abstract":"The advent of transformer-based models pre-trained on large-scale text corpora has rev-olutionized Natural Language Processing (NLP) in recent years. Models such as BERT (Bidirectional Encoder Representations from Transformers) offer powerful tools for understanding contextual information and have achieved impressive results in numerous language understanding tasks. However, their application in the aviation domain remains relatively unexplored. This study discusses the challenges of applying multi-label classification problems on aviation text data. A custom aviation domain specific BERT model (Aviation-BERT) is compared against BERT-base-uncased for anomaly event classification in the Aviation Safety Reporting System (ASRS) data. Aviation-BERT is shown to have superior performance based on multiple metrics. By focusing on the potential of NLP in advancing complex aviation safety report analysis, the present work offers a comprehensive evaluation of BERT on aviation domain datasets and discusses its strengths and weaknesses. This research highlights the significance of domain-specific NLP models in improving the accuracy and efficiency of safety report classification and analysis in the aviation industry.","PeriodicalId":403570,"journal":{"name":"AIAA AVIATION 2023 Forum","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AIAA AVIATION 2023 Forum","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2514/6.2023-3438","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

The advent of transformer-based models pre-trained on large-scale text corpora has rev-olutionized Natural Language Processing (NLP) in recent years. Models such as BERT (Bidirectional Encoder Representations from Transformers) offer powerful tools for understanding contextual information and have achieved impressive results in numerous language understanding tasks. However, their application in the aviation domain remains relatively unexplored. This study discusses the challenges of applying multi-label classification problems on aviation text data. A custom aviation domain specific BERT model (Aviation-BERT) is compared against BERT-base-uncased for anomaly event classification in the Aviation Safety Reporting System (ASRS) data. Aviation-BERT is shown to have superior performance based on multiple metrics. By focusing on the potential of NLP in advancing complex aviation safety report analysis, the present work offers a comprehensive evaluation of BERT on aviation domain datasets and discusses its strengths and weaknesses. This research highlights the significance of domain-specific NLP models in improving the accuracy and efficiency of safety report classification and analysis in the aviation industry.
BERT为航空文本分类
近年来,在大规模文本语料库上进行预训练的基于变换的模型的出现彻底改变了自然语言处理(NLP)。BERT(来自变形金刚的双向编码器表示)等模型为理解上下文信息提供了强大的工具,并在许多语言理解任务中取得了令人印象深刻的结果。然而,它们在航空领域的应用仍然相对未被探索。本文讨论了在航空文本数据中应用多标签分类问题所面临的挑战。将航空安全报告系统(ASRS)数据中的异常事件分类与航空领域特定的自定义BERT模型(aviation -BERT)进行比较。基于多个指标,Aviation-BERT显示出优越的性能。通过关注NLP在推进复杂航空安全报告分析方面的潜力,本研究对BERT在航空领域数据集上的应用进行了全面评估,并讨论了其优缺点。本研究强调了特定领域的NLP模型在提高航空业安全报告分类和分析的准确性和效率方面的重要意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信