Construction of design requirements knowledgebase from unstructured design guidelines using natural language processing

IF 8.2 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Baekgyu Kwon , Junho Kim , Hyunoh Lee , Hyo-Won Suh , Duhwan Mun
{"title":"Construction of design requirements knowledgebase from unstructured design guidelines using natural language processing","authors":"Baekgyu Kwon ,&nbsp;Junho Kim ,&nbsp;Hyunoh Lee ,&nbsp;Hyo-Won Suh ,&nbsp;Duhwan Mun","doi":"10.1016/j.compind.2024.104100","DOIUrl":null,"url":null,"abstract":"<div><p>In the manufacturing industry, unstructured documents such as design guidelines, regulatory documents, and failure cases are essential for product development. However, due to the large volume and frequent revisions of these documents, designers often find it difficult to keep up to date with the latest content. This study presents a method for analyzing the characteristics of unstructured design guidelines and automatically constructing a knowledgebase of design requirements from them. A knowledgebase is structured data that a computer can understand, and that can be used to assist designers in the design process. The knowledgebase is constructed using the sections of the document, including design variables and design requirements. The construction process involves pre-processing the documents, extracting information using natural language processing models, and generating a knowledgebase using predefined rules. A requirements knowledgebase was experimentally constructed from a standard document on the general requirements for the design of pressure vessels (American Society of Mechanical Engineers Section VIII Division 1) using the proposed method. In the experiment, the accuracy of information extraction was 86.3 %, and the generation process took 3 min and 50 s. Thus, the proposed method eliminates the need for specialized training of deep learning models and can be applied to various design guideline documents with simple modifications to the design vocabulary and rules. The knowledgebase has applications in design validation, and is expected to enhance the efficiency of the product development process and contribute to reducing the overall development timeline.</p></div>","PeriodicalId":55219,"journal":{"name":"Computers in Industry","volume":"159 ","pages":"Article 104100"},"PeriodicalIF":8.2000,"publicationDate":"2024-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in Industry","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0166361524000289","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

In the manufacturing industry, unstructured documents such as design guidelines, regulatory documents, and failure cases are essential for product development. However, due to the large volume and frequent revisions of these documents, designers often find it difficult to keep up to date with the latest content. This study presents a method for analyzing the characteristics of unstructured design guidelines and automatically constructing a knowledgebase of design requirements from them. A knowledgebase is structured data that a computer can understand, and that can be used to assist designers in the design process. The knowledgebase is constructed using the sections of the document, including design variables and design requirements. The construction process involves pre-processing the documents, extracting information using natural language processing models, and generating a knowledgebase using predefined rules. A requirements knowledgebase was experimentally constructed from a standard document on the general requirements for the design of pressure vessels (American Society of Mechanical Engineers Section VIII Division 1) using the proposed method. In the experiment, the accuracy of information extraction was 86.3 %, and the generation process took 3 min and 50 s. Thus, the proposed method eliminates the need for specialized training of deep learning models and can be applied to various design guideline documents with simple modifications to the design vocabulary and rules. The knowledgebase has applications in design validation, and is expected to enhance the efficiency of the product development process and contribute to reducing the overall development timeline.

利用自然语言处理技术,从非结构化设计指南中构建设计要求知识库
在制造业中,设计指南、监管文件和故障案例等非结构化文档对产品开发至关重要。然而,由于这些文件数量庞大、修订频繁,设计人员往往很难及时了解最新内容。本研究提出了一种方法,用于分析非结构化设计指南的特点,并从中自动构建设计要求知识库。知识库是计算机能够理解的结构化数据,可用于在设计过程中协助设计人员。知识库是利用文件的各个部分构建的,包括设计变量和设计要求。构建过程包括预处理文档、使用自然语言处理模型提取信息,以及使用预定义规则生成知识库。使用所提出的方法,从压力容器设计一般要求的标准文件(美国机械工程师协会第 VIII 章第 1 节)中构建了一个需求知识库。在实验中,信息提取的准确率为 86.3%,生成过程耗时 3 分 50 秒。因此,所提出的方法无需对深度学习模型进行专门训练,只需对设计词汇和规则进行简单修改,即可应用于各种设计指南文档。该知识库可应用于设计验证,有望提高产品开发流程的效率,并有助于缩短整体开发时间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers in Industry
Computers in Industry 工程技术-计算机:跨学科应用
CiteScore
18.90
自引率
8.00%
发文量
152
审稿时长
22 days
期刊介绍: The objective of Computers in Industry is to present original, high-quality, application-oriented research papers that: • Illuminate emerging trends and possibilities in the utilization of Information and Communication Technology in industry; • Establish connections or integrations across various technology domains within the expansive realm of computer applications for industry; • Foster connections or integrations across diverse application areas of ICT in industry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信