{"title":"Using Knowledge Graph Embedding for Fault Detection - A Case Study in Electric Vehicle Parts Assembly","authors":"Ziad Kobti, Joseph El-Ghaname","doi":"10.32473/flairs.36.133373","DOIUrl":null,"url":null,"abstract":"Automotive manufacturers are under stressful timelines as they shift their focus from internal combustion engines (ICE) to electric (EV) and hybrid-electric vehicles (HEV). The demand for this rapid change is crucial to meet a growing consumer market. New manufacturing challenges coupled with rapid change can lead to substantial safety risks for consumers as well as financial liability for automakers, especially when recalls happen. The resulting misplacement, misalignment, or defective assembly of any of the components or connectors can result in critical or even fatal outcomes for consumers. Recent findings reported by CNBC revealed that the shift to electric vehicles had cost automakers billions of dollars (Kolodny 2022). The cost of recalling an EV far outweighs that of an ICE. For instance, the Ford Kuga plug-in HEV had re-calls costs of about $19,000 per vehicle, in contrast to a typical ICE vehicle recall that averages around $500 per vehicle (Isidore and Vales-Dapena 2022). Furthermore, the EV recall rate has been higher. For instance, China’s EV recall rate was approximately 6.9% of its total sales volume (Hao et al. 2021).Automakers are highly motivated to prevent automotive recalls by implementing and employing several preventative measures. IoT sensor-based fault detection systems, as well as those with camera capabilities, have been used to detect defects during production and assembly processes. Industry 4.0 standards (Garofalo et al. 2022) have been adopted, particularly when companies employ an autonomous assembly process.A critical issue in vision or sensor-based fault detection systems is their limitations, where they can only analyze and observe end components without analyzing the relationships and possible underlying connections with other components. For instance, these relationships can reveal whether a given component is missing or is connected correctly to another component. Simply relying on machine vision examining components in isolation, especially in uncontrolled manufacturing environments, becomes difficult and reliable, not to mention the extremely de-manding computational power needed for vision processing.The motivation of this research work is to present an alternative perspective that employs a collective view of components, represented as a networked graph, particularly a knowledge graph (KG) that we hypothesize its ability to be effective in analyzing data in the search for faults.KGs are a collection of real-world fact triplets of the structured form (head, relation, tail) (Hogan et al. 2022). Fundamentally, KGs can be expressed as a graph where nodes represent components or sub-components, and edges indicate a relationship between the two adjacent components. Hence, KG can be used to effectively represent and map interconnected components during and after manufacturing. Researchers have demonstrated the usefulness of Knowledge Graph Embedding (KGE) as a potential solution for automotive fault detection, and they have used it to advance their autonomous driving solutions (Bosch Global 2022).This research aims at building KGs and testing their effectiveness in detecting faults in a custom dataset. We implement a KG Completion (KGC) algorithm and compare different KG Embedding (KGE) models. Furthermore, we measure and compare the Mean Reciprocal Rank (MRR) and Hits@K to evaluate the algorithm based on various KGE approaches and models. Our findings from our experiments pave a new pathway for vehicle manufacturers and car makers, allowing for a feasible and comprehensive fault detection system and framework. By combining state-of-the-art KGE models and a first-hand case study involving an electric vehicle knowledge graph dataset (EV-KG), this work solidifies future KG-related fault detection research in the field and opens numerous opportunities for further development and application in the real-world industry.Link prediction in knowledge graphs has accelerated in recent years through various state-of-the-art research works and publications, especially KGE-based methods like RotatE (Bollacker et al. 2008), which allows for more accurate and efficient prediction of missing connections (or edges) between entities (or nodes) in a graph. Specifically, the integration of link prediction and KGs enables the ability for data to be analyzed not simply as individual components or entities but as an interconnected system made up of various components, such as those in an electric vehicle.In our method, we first embark on building the EV-KG dataset and develop the components of all physical connections and relations drawn from domain experts and manufacturer documentation. A dictionary file is built for each component and its defined relations with other components. Next, an RDF file format is generated for testing and validation. This is built using the (head, relation, tail) relationship such as (battery_positive_connection, terms-nal_of, battery_cell). We take the KG dataset and pre-process the data such that the data is randomized and split into three distinct sets: a training dataset, a validation dataset, and a testing dataset. Each dataset is analyzed individually by the KGE model for various phases such as training (training dataset) and evaluation (validation and testing datasets). A score function is used to give a score for each of the candidate triples. The score represents the distance between the two nodes, thus, similar to a ranking metric, the lower the score, the better. The experiments were conducted on a high-performance cluster where we have compiled and built a KG specifically based on an electric vehicle’s layout, factoring in various parts that can be faulty, and through that dataset, we will perform our experiment. The dataset we built contains 1378 nodes, 2200 edges, and 15 unique relations. The model was tested using RotatE, HRotatE, pRotatE, DistMult, ComplEx, and TransE modes (Sun et al, 2019), where we found RotatE achieved the best overall score of 0.922 Hits@100.By studying the feasibility of implementing a KG-based fault detection system, this work heavily emphasizes the need for making a more efficient solution for detecting faults and defects in an automotive manufacturing environment. Just as important is the accuracy of this KG-based fault detection system since it will also affect potential losses if, for example, there are too many false negatives or false positives. One of the goals of this work is to also make sure automakers have a choice when it comes to fault detection systems that provide early detection of defects before it becomes in the hands of the consumer.","PeriodicalId":302103,"journal":{"name":"The International FLAIRS Conference Proceedings","volume":"107 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The International FLAIRS Conference Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32473/flairs.36.133373","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Automotive manufacturers are under stressful timelines as they shift their focus from internal combustion engines (ICE) to electric (EV) and hybrid-electric vehicles (HEV). The demand for this rapid change is crucial to meet a growing consumer market. New manufacturing challenges coupled with rapid change can lead to substantial safety risks for consumers as well as financial liability for automakers, especially when recalls happen. The resulting misplacement, misalignment, or defective assembly of any of the components or connectors can result in critical or even fatal outcomes for consumers. Recent findings reported by CNBC revealed that the shift to electric vehicles had cost automakers billions of dollars (Kolodny 2022). The cost of recalling an EV far outweighs that of an ICE. For instance, the Ford Kuga plug-in HEV had re-calls costs of about $19,000 per vehicle, in contrast to a typical ICE vehicle recall that averages around $500 per vehicle (Isidore and Vales-Dapena 2022). Furthermore, the EV recall rate has been higher. For instance, China’s EV recall rate was approximately 6.9% of its total sales volume (Hao et al. 2021).Automakers are highly motivated to prevent automotive recalls by implementing and employing several preventative measures. IoT sensor-based fault detection systems, as well as those with camera capabilities, have been used to detect defects during production and assembly processes. Industry 4.0 standards (Garofalo et al. 2022) have been adopted, particularly when companies employ an autonomous assembly process.A critical issue in vision or sensor-based fault detection systems is their limitations, where they can only analyze and observe end components without analyzing the relationships and possible underlying connections with other components. For instance, these relationships can reveal whether a given component is missing or is connected correctly to another component. Simply relying on machine vision examining components in isolation, especially in uncontrolled manufacturing environments, becomes difficult and reliable, not to mention the extremely de-manding computational power needed for vision processing.The motivation of this research work is to present an alternative perspective that employs a collective view of components, represented as a networked graph, particularly a knowledge graph (KG) that we hypothesize its ability to be effective in analyzing data in the search for faults.KGs are a collection of real-world fact triplets of the structured form (head, relation, tail) (Hogan et al. 2022). Fundamentally, KGs can be expressed as a graph where nodes represent components or sub-components, and edges indicate a relationship between the two adjacent components. Hence, KG can be used to effectively represent and map interconnected components during and after manufacturing. Researchers have demonstrated the usefulness of Knowledge Graph Embedding (KGE) as a potential solution for automotive fault detection, and they have used it to advance their autonomous driving solutions (Bosch Global 2022).This research aims at building KGs and testing their effectiveness in detecting faults in a custom dataset. We implement a KG Completion (KGC) algorithm and compare different KG Embedding (KGE) models. Furthermore, we measure and compare the Mean Reciprocal Rank (MRR) and Hits@K to evaluate the algorithm based on various KGE approaches and models. Our findings from our experiments pave a new pathway for vehicle manufacturers and car makers, allowing for a feasible and comprehensive fault detection system and framework. By combining state-of-the-art KGE models and a first-hand case study involving an electric vehicle knowledge graph dataset (EV-KG), this work solidifies future KG-related fault detection research in the field and opens numerous opportunities for further development and application in the real-world industry.Link prediction in knowledge graphs has accelerated in recent years through various state-of-the-art research works and publications, especially KGE-based methods like RotatE (Bollacker et al. 2008), which allows for more accurate and efficient prediction of missing connections (or edges) between entities (or nodes) in a graph. Specifically, the integration of link prediction and KGs enables the ability for data to be analyzed not simply as individual components or entities but as an interconnected system made up of various components, such as those in an electric vehicle.In our method, we first embark on building the EV-KG dataset and develop the components of all physical connections and relations drawn from domain experts and manufacturer documentation. A dictionary file is built for each component and its defined relations with other components. Next, an RDF file format is generated for testing and validation. This is built using the (head, relation, tail) relationship such as (battery_positive_connection, terms-nal_of, battery_cell). We take the KG dataset and pre-process the data such that the data is randomized and split into three distinct sets: a training dataset, a validation dataset, and a testing dataset. Each dataset is analyzed individually by the KGE model for various phases such as training (training dataset) and evaluation (validation and testing datasets). A score function is used to give a score for each of the candidate triples. The score represents the distance between the two nodes, thus, similar to a ranking metric, the lower the score, the better. The experiments were conducted on a high-performance cluster where we have compiled and built a KG specifically based on an electric vehicle’s layout, factoring in various parts that can be faulty, and through that dataset, we will perform our experiment. The dataset we built contains 1378 nodes, 2200 edges, and 15 unique relations. The model was tested using RotatE, HRotatE, pRotatE, DistMult, ComplEx, and TransE modes (Sun et al, 2019), where we found RotatE achieved the best overall score of 0.922 Hits@100.By studying the feasibility of implementing a KG-based fault detection system, this work heavily emphasizes the need for making a more efficient solution for detecting faults and defects in an automotive manufacturing environment. Just as important is the accuracy of this KG-based fault detection system since it will also affect potential losses if, for example, there are too many false negatives or false positives. One of the goals of this work is to also make sure automakers have a choice when it comes to fault detection systems that provide early detection of defects before it becomes in the hands of the consumer.
随着汽车制造商将重点从内燃机(ICE)转向电动汽车(EV)和混合动力汽车(HEV),他们面临着巨大的时间压力。对这种快速变化的需求对于满足不断增长的消费市场至关重要。新的制造挑战加上快速变化可能会给消费者带来巨大的安全风险,也会给汽车制造商带来财务责任,尤其是在发生召回的情况下。由此产生的任何组件或连接器的错位,错位或有缺陷的组装都可能导致严重甚至致命的后果。美国全国广播公司财经频道最近的调查结果显示,向电动汽车的转变使汽车制造商损失了数十亿美元(Kolodny 2022)。召回一辆电动汽车的成本远远超过召回一辆内燃机的成本。例如,福特Kuga插电式混合动力汽车的召回成本约为每辆19,000美元,而典型的ICE汽车召回成本平均约为每辆500美元(Isidore and Vales-Dapena 2022)。此外,电动汽车召回率也更高。例如,中国的电动汽车召回率约为其总销量的6.9% (Hao et al. 2021)。汽车制造商非常积极地通过实施和采用几种预防措施来防止汽车召回。基于物联网传感器的故障检测系统以及具有摄像头功能的故障检测系统已被用于检测生产和装配过程中的缺陷。工业4.0标准(Garofalo et al. 2022)已被采用,特别是当公司采用自主装配过程时。基于视觉或传感器的故障检测系统的一个关键问题是它们的局限性,它们只能分析和观察终端组件,而不能分析与其他组件的关系和可能的潜在连接。例如,这些关系可以显示给定组件是否缺失或是否正确连接到另一个组件。简单地依靠机器视觉孤立地检查组件,特别是在不受控制的制造环境中,变得困难和可靠,更不用说视觉处理所需的极其苛刻的计算能力。这项研究工作的动机是提出另一种视角,采用组件的集体视图,表示为网络图,特别是知识图(KG),我们假设它能够有效地分析数据以寻找故障。kg是结构化形式(头、关系、尾)的现实世界事实三元组的集合(Hogan et al. 2022)。基本上,KGs可以表示为一个图,其中节点表示组件或子组件,边表示两个相邻组件之间的关系。因此,KG可用于在制造期间和之后有效地表示和映射相互连接的组件。研究人员已经证明了知识图谱嵌入(KGE)作为汽车故障检测的潜在解决方案的实用性,并将其用于推进自动驾驶解决方案(Bosch Global 2022)。本研究旨在构建KGs并测试其在自定义数据集中检测故障的有效性。我们实现了一种KG补全(KGC)算法,并比较了不同的KG嵌入(KGE)模型。此外,我们测量并比较了平均倒数秩(MRR)和Hits@K,以评估基于各种KGE方法和模型的算法。我们的实验结果为汽车制造商和汽车制造商铺平了一条新的道路,允许一个可行的和全面的故障检测系统和框架。通过结合最先进的KGE模型和涉及电动汽车知识图谱数据集(EV-KG)的第一手案例研究,这项工作巩固了未来与kg相关的故障检测领域的研究,并为进一步发展和应用于现实世界的行业提供了许多机会。近年来,通过各种最先进的研究工作和出版物,知识图中的链接预测得到了加速,特别是基于kge的方法,如RotatE (Bollacker等人,2008),它可以更准确、更有效地预测图中实体(或节点)之间缺失的连接(或边)。具体来说,链路预测和KGs的集成使数据不仅可以作为单个组件或实体进行分析,还可以作为由各种组件组成的相互连接的系统进行分析,例如电动汽车中的组件。在我们的方法中,我们首先着手构建EV-KG数据集,并开发从领域专家和制造商文档中提取的所有物理连接和关系的组件。为每个组件及其与其他组件的定义关系构建字典文件。接下来,生成用于测试和验证的RDF文件格式。这是使用(头、关系、尾)关系构建的,例如(battery_positive_connection、terms-nal_of、battery_cell)。