Predicting Technical Problems of Hydropower Engineering Using eXtreme Gradient Boosting

Jing Zhu, Yi Chen, Limin Huang, Chunyong She, Yangfeng Wu, Wenyu Zhang
{"title":"Predicting Technical Problems of Hydropower Engineering Using eXtreme Gradient Boosting","authors":"Jing Zhu, Yi Chen, Limin Huang, Chunyong She, Yangfeng Wu, Wenyu Zhang","doi":"10.11648/J.SJAMS.20180604.13","DOIUrl":null,"url":null,"abstract":"Nowadays, water shortage is increasingly severe, which has huge negative influence on daily life. Constructing hydropower engineering is one of the approaches to alleviate such problem. Therefore, it’s worth settling technical problems of hydropower engineering timely, which will help people not only make better use of water resources but also get rid of various security risks. To achieve such goal, this study predicts potential technical problems that hydropower engineering might happen. In order to utilize the large amount of data, data mining techniques are used to solve this multi-classification problem. First of all, plenty of data is preprocessed. Particularly, because of the complexity of text data, text mining techniques are applied to transform the unstructured data to structural data. Then, eXtreme Gradient Boosting (XGBoost) is applied to make the classification. To validate efficiency of the model, comparisons are made among XGBoost, Gradient Boosting Decision Tree, Random Forest, Decision Tree, k-Nearest Neighbor and Bernoulli Naive Bayes from the perspective of accuracy, precision, recall and f-score. The experimental result shows that XGBoost is more suitable to solve this classification problem. This study provides engineering inspectors with helpful suggestions of particular technical problems that need attention, and further enables people to inspect engineering more efficiently and effectively.","PeriodicalId":422938,"journal":{"name":"Science Journal of Applied Mathematics and Statistics","volume":"84 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science Journal of Applied Mathematics and Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11648/J.SJAMS.20180604.13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Nowadays, water shortage is increasingly severe, which has huge negative influence on daily life. Constructing hydropower engineering is one of the approaches to alleviate such problem. Therefore, it’s worth settling technical problems of hydropower engineering timely, which will help people not only make better use of water resources but also get rid of various security risks. To achieve such goal, this study predicts potential technical problems that hydropower engineering might happen. In order to utilize the large amount of data, data mining techniques are used to solve this multi-classification problem. First of all, plenty of data is preprocessed. Particularly, because of the complexity of text data, text mining techniques are applied to transform the unstructured data to structural data. Then, eXtreme Gradient Boosting (XGBoost) is applied to make the classification. To validate efficiency of the model, comparisons are made among XGBoost, Gradient Boosting Decision Tree, Random Forest, Decision Tree, k-Nearest Neighbor and Bernoulli Naive Bayes from the perspective of accuracy, precision, recall and f-score. The experimental result shows that XGBoost is more suitable to solve this classification problem. This study provides engineering inspectors with helpful suggestions of particular technical problems that need attention, and further enables people to inspect engineering more efficiently and effectively.
利用极值梯度助推预测水电工程技术问题
如今,水资源短缺日益严重,这对日常生活产生了巨大的负面影响。建设水电工程是缓解这一问题的途径之一。因此,及时解决水电工程的技术问题是值得的,这不仅有助于人们更好地利用水资源,也有助于人们摆脱各种安全风险。为了实现这一目标,本研究对水电工程可能发生的潜在技术问题进行了预测。为了利用海量数据,采用数据挖掘技术来解决这种多分类问题。首先,需要对大量数据进行预处理。特别是,由于文本数据的复杂性,文本挖掘技术被用于将非结构化数据转换为结构化数据。然后,应用极限梯度增强(XGBoost)进行分类。为了验证模型的有效性,从正确率、精密度、召回率和f-score的角度对XGBoost、梯度增强决策树、随机森林、决策树、k近邻和伯努利朴素贝叶斯进行了比较。实验结果表明,XGBoost更适合解决这一分类问题。本研究为工程检查员提供了具体技术问题需要注意的有益建议,进一步使人们能够更高效、更有效地进行工程检查。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信