Work Orders - Value from Structureless Text in the Era of Digitisation

Erik Salo, David McMillan, R. Connor
{"title":"Work Orders - Value from Structureless Text in the Era of Digitisation","authors":"Erik Salo, David McMillan, R. Connor","doi":"10.2118/195788-MS","DOIUrl":null,"url":null,"abstract":"\n Free text and hand-written reports are losing ground to digitization fast, however many hours of effort are still lost across the industry to the manual creation and analysis of these data types. Work orders in particular contain valuable information from failure rates to asset health, but at the same time present operators with such analytical difficulties and lack of structure that many are missing out on the value completely. This research challenges the current mainstream practice of manual work order analysis by presenting a methodology fit for today’s context of efficiency and digitization.\n A prototype text mining software for work order analysis was developed and tested in a user-oriented approach in cooperation with industrial partners. The final prototype combines classical machine learning methods, such as hierarchical clustering, with the operator’s expert knowledge obtained via an active learning approach. A novel distance metric in this context was adapted from information-theoretical research to improve clustering performance.\n Using the prototype tool in a case study with real work order data, analytical effort for certain datasets was reduced by 90% - from two working weeks to a day. In addition, the active learning framework resulted in an approach that end users described as \"practical\" and \"intuitive\" during testing. An in-depth review was also conducted regarding the uncertainty of the results – a key factor for implementation in a decision-making context.\n The outcomes of this work showcase the potential of machine learning to drive the digitization of not only new installations, but also older assets, where as a result the large amount of unstructured historical data becomes an advantage rather than a hindrance. User testing results encourage a wider uptake of machine learning solutions in the industry, and particularly a shift towards more accessible in-house analytical capabilities.","PeriodicalId":113290,"journal":{"name":"Day 2 Wed, September 04, 2019","volume":"120 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 2 Wed, September 04, 2019","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2118/195788-MS","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Free text and hand-written reports are losing ground to digitization fast, however many hours of effort are still lost across the industry to the manual creation and analysis of these data types. Work orders in particular contain valuable information from failure rates to asset health, but at the same time present operators with such analytical difficulties and lack of structure that many are missing out on the value completely. This research challenges the current mainstream practice of manual work order analysis by presenting a methodology fit for today’s context of efficiency and digitization. A prototype text mining software for work order analysis was developed and tested in a user-oriented approach in cooperation with industrial partners. The final prototype combines classical machine learning methods, such as hierarchical clustering, with the operator’s expert knowledge obtained via an active learning approach. A novel distance metric in this context was adapted from information-theoretical research to improve clustering performance. Using the prototype tool in a case study with real work order data, analytical effort for certain datasets was reduced by 90% - from two working weeks to a day. In addition, the active learning framework resulted in an approach that end users described as "practical" and "intuitive" during testing. An in-depth review was also conducted regarding the uncertainty of the results – a key factor for implementation in a decision-making context. The outcomes of this work showcase the potential of machine learning to drive the digitization of not only new installations, but also older assets, where as a result the large amount of unstructured historical data becomes an advantage rather than a hindrance. User testing results encourage a wider uptake of machine learning solutions in the industry, and particularly a shift towards more accessible in-house analytical capabilities.
工单——数字化时代无结构文本的价值
自由文本和手写报告正迅速被数字化所取代,然而,在整个行业中,手工创建和分析这些数据类型仍然耗费了大量的时间。特别是作业订单包含了从故障率到资产健康状况的宝贵信息,但同时也给作业者带来了分析困难和缺乏结构,许多人完全错过了价值。本研究通过提出一种适合当今效率和数字化背景的方法,挑战了当前手工工单分析的主流实践。与工业伙伴合作,以面向用户的方式开发并测试了用于工单分析的原型文本挖掘软件。最终的原型结合了经典的机器学习方法,如分层聚类,以及通过主动学习方法获得的操作员专业知识。在这种情况下,一种新的距离度量从信息理论研究中改编而来,以提高聚类性能。在实际工作订单数据的案例研究中使用原型工具,对某些数据集的分析工作减少了90%——从两个工作周减少到一天。此外,主动学习框架产生了一种最终用户在测试期间描述为“实用”和“直观”的方法。还对结果的不确定性进行了深入审查- -这是在决策范围内执行的一个关键因素。这项工作的结果展示了机器学习的潜力,不仅可以推动新设备的数字化,还可以推动旧资产的数字化,因此,大量的非结构化历史数据成为一种优势,而不是障碍。用户测试结果鼓励行业更广泛地采用机器学习解决方案,特别是向更易于访问的内部分析能力转变。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信