Baekgyu Kwon , Junho Kim , Hyunoh Lee , Hyo-Won Suh , Duhwan Mun
{"title":"利用自然语言处理技术,从非结构化设计指南中构建设计要求知识库","authors":"Baekgyu Kwon , Junho Kim , Hyunoh Lee , Hyo-Won Suh , Duhwan Mun","doi":"10.1016/j.compind.2024.104100","DOIUrl":null,"url":null,"abstract":"<div><p>In the manufacturing industry, unstructured documents such as design guidelines, regulatory documents, and failure cases are essential for product development. However, due to the large volume and frequent revisions of these documents, designers often find it difficult to keep up to date with the latest content. This study presents a method for analyzing the characteristics of unstructured design guidelines and automatically constructing a knowledgebase of design requirements from them. A knowledgebase is structured data that a computer can understand, and that can be used to assist designers in the design process. The knowledgebase is constructed using the sections of the document, including design variables and design requirements. The construction process involves pre-processing the documents, extracting information using natural language processing models, and generating a knowledgebase using predefined rules. A requirements knowledgebase was experimentally constructed from a standard document on the general requirements for the design of pressure vessels (American Society of Mechanical Engineers Section VIII Division 1) using the proposed method. In the experiment, the accuracy of information extraction was 86.3 %, and the generation process took 3 min and 50 s. Thus, the proposed method eliminates the need for specialized training of deep learning models and can be applied to various design guideline documents with simple modifications to the design vocabulary and rules. The knowledgebase has applications in design validation, and is expected to enhance the efficiency of the product development process and contribute to reducing the overall development timeline.</p></div>","PeriodicalId":55219,"journal":{"name":"Computers in Industry","volume":"159 ","pages":"Article 104100"},"PeriodicalIF":8.2000,"publicationDate":"2024-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Construction of design requirements knowledgebase from unstructured design guidelines using natural language processing\",\"authors\":\"Baekgyu Kwon , Junho Kim , Hyunoh Lee , Hyo-Won Suh , Duhwan Mun\",\"doi\":\"10.1016/j.compind.2024.104100\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In the manufacturing industry, unstructured documents such as design guidelines, regulatory documents, and failure cases are essential for product development. However, due to the large volume and frequent revisions of these documents, designers often find it difficult to keep up to date with the latest content. This study presents a method for analyzing the characteristics of unstructured design guidelines and automatically constructing a knowledgebase of design requirements from them. A knowledgebase is structured data that a computer can understand, and that can be used to assist designers in the design process. The knowledgebase is constructed using the sections of the document, including design variables and design requirements. The construction process involves pre-processing the documents, extracting information using natural language processing models, and generating a knowledgebase using predefined rules. A requirements knowledgebase was experimentally constructed from a standard document on the general requirements for the design of pressure vessels (American Society of Mechanical Engineers Section VIII Division 1) using the proposed method. In the experiment, the accuracy of information extraction was 86.3 %, and the generation process took 3 min and 50 s. Thus, the proposed method eliminates the need for specialized training of deep learning models and can be applied to various design guideline documents with simple modifications to the design vocabulary and rules. The knowledgebase has applications in design validation, and is expected to enhance the efficiency of the product development process and contribute to reducing the overall development timeline.</p></div>\",\"PeriodicalId\":55219,\"journal\":{\"name\":\"Computers in Industry\",\"volume\":\"159 \",\"pages\":\"Article 104100\"},\"PeriodicalIF\":8.2000,\"publicationDate\":\"2024-05-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers in Industry\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0166361524000289\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in Industry","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0166361524000289","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
摘要
在制造业中,设计指南、监管文件和故障案例等非结构化文档对产品开发至关重要。然而,由于这些文件数量庞大、修订频繁,设计人员往往很难及时了解最新内容。本研究提出了一种方法,用于分析非结构化设计指南的特点,并从中自动构建设计要求知识库。知识库是计算机能够理解的结构化数据,可用于在设计过程中协助设计人员。知识库是利用文件的各个部分构建的,包括设计变量和设计要求。构建过程包括预处理文档、使用自然语言处理模型提取信息,以及使用预定义规则生成知识库。使用所提出的方法,从压力容器设计一般要求的标准文件(美国机械工程师协会第 VIII 章第 1 节)中构建了一个需求知识库。在实验中,信息提取的准确率为 86.3%,生成过程耗时 3 分 50 秒。因此,所提出的方法无需对深度学习模型进行专门训练,只需对设计词汇和规则进行简单修改,即可应用于各种设计指南文档。该知识库可应用于设计验证,有望提高产品开发流程的效率,并有助于缩短整体开发时间。
Construction of design requirements knowledgebase from unstructured design guidelines using natural language processing
In the manufacturing industry, unstructured documents such as design guidelines, regulatory documents, and failure cases are essential for product development. However, due to the large volume and frequent revisions of these documents, designers often find it difficult to keep up to date with the latest content. This study presents a method for analyzing the characteristics of unstructured design guidelines and automatically constructing a knowledgebase of design requirements from them. A knowledgebase is structured data that a computer can understand, and that can be used to assist designers in the design process. The knowledgebase is constructed using the sections of the document, including design variables and design requirements. The construction process involves pre-processing the documents, extracting information using natural language processing models, and generating a knowledgebase using predefined rules. A requirements knowledgebase was experimentally constructed from a standard document on the general requirements for the design of pressure vessels (American Society of Mechanical Engineers Section VIII Division 1) using the proposed method. In the experiment, the accuracy of information extraction was 86.3 %, and the generation process took 3 min and 50 s. Thus, the proposed method eliminates the need for specialized training of deep learning models and can be applied to various design guideline documents with simple modifications to the design vocabulary and rules. The knowledgebase has applications in design validation, and is expected to enhance the efficiency of the product development process and contribute to reducing the overall development timeline.
期刊介绍:
The objective of Computers in Industry is to present original, high-quality, application-oriented research papers that:
• Illuminate emerging trends and possibilities in the utilization of Information and Communication Technology in industry;
• Establish connections or integrations across various technology domains within the expansive realm of computer applications for industry;
• Foster connections or integrations across diverse application areas of ICT in industry.