Application of big data technology in electromechanical operation and maintenance intelligent platform

Paladyn : journal of behavioral robotics Pub Date : 2023-01-01 DOI:10.1515/pjbr-2022-0121

Wenjuan Yang, Zhongbin Chan, Yang Wang, Fuli Qi

{"title":"Application of big data technology in electromechanical operation and maintenance intelligent platform","authors":"Wenjuan Yang, Zhongbin Chan, Yang Wang, Fuli Qi","doi":"10.1515/pjbr-2022-0121","DOIUrl":null,"url":null,"abstract":"Abstract Aiming at the data preprocessing requirements and label data cost issues arising from the intelligent operation and maintenance of electromechanical equipment, this article mainly studies structured data cleaning methods and fault prediction algorithms for a small number of label samples. First, this article introduces the overall architecture of the intelligent operation and maintenance system for electromechanical equipment. Second, based on the electromechanical equipment operation and maintenance data access service, data cleaning, and fault prediction, this article constructs an electromechanical equipment intelligent operation and maintenance platform based on Kafka message queue, Spark cluster, and other components, and introduces the functional composition of the system in detail. Finally, the article describes the functions of each component of data access service, data cleaning, and fault prediction in detail. To address the cost issue associated with sufficient labeled sample data for data analysis, we propose a semi-supervised learning algorithm, IF-GBDT, based on improved independent forests and Gradient Boosting Decision Tree. The independent forest algorithm supplements labels for unlabeled data based on the learning results of a small number of labeled samples. We also use the gradient lifting tree algorithm to train the model based on the new tag data set for fault prediction, thereby reducing the impact of lack of tags on the accuracy of the prediction model. Experiments show that this method improves classification accuracy and has good adaptability and concurrency performance for a small number of tags.","PeriodicalId":90037,"journal":{"name":"Paladyn : journal of behavioral robotics","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Paladyn : journal of behavioral robotics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/pjbr-2022-0121","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Abstract Aiming at the data preprocessing requirements and label data cost issues arising from the intelligent operation and maintenance of electromechanical equipment, this article mainly studies structured data cleaning methods and fault prediction algorithms for a small number of label samples. First, this article introduces the overall architecture of the intelligent operation and maintenance system for electromechanical equipment. Second, based on the electromechanical equipment operation and maintenance data access service, data cleaning, and fault prediction, this article constructs an electromechanical equipment intelligent operation and maintenance platform based on Kafka message queue, Spark cluster, and other components, and introduces the functional composition of the system in detail. Finally, the article describes the functions of each component of data access service, data cleaning, and fault prediction in detail. To address the cost issue associated with sufficient labeled sample data for data analysis, we propose a semi-supervised learning algorithm, IF-GBDT, based on improved independent forests and Gradient Boosting Decision Tree. The independent forest algorithm supplements labels for unlabeled data based on the learning results of a small number of labeled samples. We also use the gradient lifting tree algorithm to train the model based on the new tag data set for fault prediction, thereby reducing the impact of lack of tags on the accuracy of the prediction model. Experiments show that this method improves classification accuracy and has good adaptability and concurrency performance for a small number of tags.

查看原文本刊更多论文

大数据技术在机电运维智能平台中的应用

摘要针对机电设备智能运维过程中出现的数据预处理要求和标签数据成本问题，本文主要研究了少量标签样本的结构化数据清洗方法和故障预测算法。本文首先介绍了机电设备智能运维系统的总体架构。其次，基于机电设备运维数据访问服务、数据清洗和故障预测，构建了基于Kafka消息队列、Spark集群等组件的机电设备智能运维平台，并详细介绍了系统的功能组成。最后，详细描述了数据访问服务、数据清理和故障预测等各个组件的功能。为了解决与数据分析所需的足够标记样本数据相关的成本问题，我们提出了一种基于改进独立森林和梯度增强决策树的半监督学习算法IF-GBDT。独立森林算法根据少量标记样本的学习结果，对未标记的数据补充标签。我们还使用梯度提升树算法基于新的标签数据集训练模型进行故障预测，从而减少了缺乏标签对预测模型准确性的影响。实验表明，该方法提高了分类精度，对少量标签具有良好的适应性和并发性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Paladyn : journal of behavioral robotics

自引率

0.00%

发文量