The semantic correlation mining method of multimodal data in constructing techno-economic knowledge graph of power grid

IF 4.3

Intelligent Systems with Applications Pub Date : 2025-09-21 DOI:10.1016/j.iswa.2025.200588

Ling Qiu, Mengqi Pan, Nuoya Lv

{"title":"The semantic correlation mining method of multimodal data in constructing techno-economic knowledge graph of power grid","authors":"Ling Qiu, Mengqi Pan, Nuoya Lv","doi":"10.1016/j.iswa.2025.200588","DOIUrl":null,"url":null,"abstract":"<div><div>Due to the diverse formats and complex structures of multimodal data, effectively managing its complexity and correlations remains challenging. Moreover, when dealing with large-scale data, traditional methods often encounter issues such as low computational efficiency and inaccurate results. This paper proposes a semantic association mining method for multimodal data. This method utilizes ETL technology to convert text and table data from different files into nodes and relational edges in the knowledge graph. By optimizing the word vector matrix through the skip character model, it can better capture the semantic information of text data and accurately reflect semantic similarity. Through integrating nodes such as equipment, design technologies and installation addresses, a technical and economic knowledge graph of the power grid is constructed. For the calculation of multimodal object associations, the data first undergoes label preprocessing, feature processing, and semantic relationship structuring before the association is computed using the cosine similarity formula. By using the association rule algorithm to mine the correlation relationships among time-series variables, potential correlations such as the operating status of equipment and the overall performance of the power grid can be discovered, thereby improving the understanding and prediction ability of the power grid’s operating status. The experimental results demonstrate that the proposed method achieves the highest accuracy and recall rate at 98.20 %, with an F-measure of 93.89 %, a bit error rate below 0.9, and a time consumption of approximately 7.34 s.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"28 ","pages":"Article 200588"},"PeriodicalIF":4.3000,"publicationDate":"2025-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent Systems with Applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667305325001140","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Due to the diverse formats and complex structures of multimodal data, effectively managing its complexity and correlations remains challenging. Moreover, when dealing with large-scale data, traditional methods often encounter issues such as low computational efficiency and inaccurate results. This paper proposes a semantic association mining method for multimodal data. This method utilizes ETL technology to convert text and table data from different files into nodes and relational edges in the knowledge graph. By optimizing the word vector matrix through the skip character model, it can better capture the semantic information of text data and accurately reflect semantic similarity. Through integrating nodes such as equipment, design technologies and installation addresses, a technical and economic knowledge graph of the power grid is constructed. For the calculation of multimodal object associations, the data first undergoes label preprocessing, feature processing, and semantic relationship structuring before the association is computed using the cosine similarity formula. By using the association rule algorithm to mine the correlation relationships among time-series variables, potential correlations such as the operating status of equipment and the overall performance of the power grid can be discovered, thereby improving the understanding and prediction ability of the power grid’s operating status. The experimental results demonstrate that the proposed method achieves the highest accuracy and recall rate at 98.20 %, with an F-measure of 93.89 %, a bit error rate below 0.9, and a time consumption of approximately 7.34 s.

查看原文本刊更多论文

构建电网技术经济知识图谱中多模态数据的语义关联挖掘方法

由于多模态数据的多种格式和复杂结构，有效管理其复杂性和相关性仍然具有挑战性。此外，在处理大规模数据时，传统方法往往会遇到计算效率低、结果不准确等问题。提出了一种多模态数据的语义关联挖掘方法。该方法利用ETL技术将不同文件中的文本和表格数据转换为知识图中的节点和关系边。通过跳过字符模型对词向量矩阵进行优化，可以更好地捕捉文本数据的语义信息，准确反映语义相似度。通过对设备、设计技术、安装地址等节点的整合，构建了电网的技术经济知识图谱。在计算多模态对象关联时，首先对数据进行标签预处理、特征处理和语义关系构建，然后使用余弦相似度公式计算关联。利用关联规则算法挖掘时间序列变量之间的相关关系，可以发现设备运行状态与电网整体性能等潜在的相关性，从而提高对电网运行状态的理解和预测能力。实验结果表明，该方法达到了98.20%的最高准确率和召回率，f值为93.89%，误码率低于0.9，时间消耗约为7.34 s。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Intelligent Systems with Applications

CiteScore

5.60

自引率

0.00%

发文量