A Semi-supervised Modeling Method for Dioxin Emission Prediction Based on Random Forest

Wen Xu, Jian Tang, Heng Xia, Jian Zhang, Wen Yu, J. Qiao
{"title":"A Semi-supervised Modeling Method for Dioxin Emission Prediction Based on Random Forest","authors":"Wen Xu, Jian Tang, Heng Xia, Jian Zhang, Wen Yu, J. Qiao","doi":"10.1109/acait53529.2021.9731260","DOIUrl":null,"url":null,"abstract":"Dioxin (DXN) is a kind of pollutants with cumulative effects in the municipal solid waste incineration (MSWI) process. Its emission concentration is difficult to detect online and in real-time, which restricts the operational optimization of the MSWI process. At the same time, it is difficult to meet actual needs through traditional supervised modeling methods because of the high time and economic cost of directly measuring DXN. Therefore, a DXN emission prediction model based on semi-supervised random forest (SSRF) is established to make full use of the unlabeled data obtained in the actual industrial process. First, the training subsets are acquired through randomly sampling the labeled data. Second, the training subsets are utilized to build multiple random forest (RF) models and pseudo-label the unlabeled data. Finally, the mixed samples composed of pseudo-labeled data and labeled data are used to train an RF model for predicting the DXN emission concentration. The proposed method is verified by the actual DXN dataset.","PeriodicalId":173633,"journal":{"name":"2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/acait53529.2021.9731260","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Dioxin (DXN) is a kind of pollutants with cumulative effects in the municipal solid waste incineration (MSWI) process. Its emission concentration is difficult to detect online and in real-time, which restricts the operational optimization of the MSWI process. At the same time, it is difficult to meet actual needs through traditional supervised modeling methods because of the high time and economic cost of directly measuring DXN. Therefore, a DXN emission prediction model based on semi-supervised random forest (SSRF) is established to make full use of the unlabeled data obtained in the actual industrial process. First, the training subsets are acquired through randomly sampling the labeled data. Second, the training subsets are utilized to build multiple random forest (RF) models and pseudo-label the unlabeled data. Finally, the mixed samples composed of pseudo-labeled data and labeled data are used to train an RF model for predicting the DXN emission concentration. The proposed method is verified by the actual DXN dataset.
基于随机森林的二恶英排放预测半监督建模方法
二恶英(DXN)是城市生活垃圾焚烧过程中具有累积效应的污染物。其排放浓度难以在线实时检测,制约了城市生活污水处理过程的运行优化。同时,由于直接测量DXN的时间和经济成本高,传统的监督建模方法难以满足实际需要。因此,为了充分利用实际工业过程中获得的未标记数据,建立了基于半监督随机森林(SSRF)的DXN排放预测模型。首先,对标记数据进行随机采样,得到训练子集。其次,利用训练子集建立多个随机森林模型,对未标记的数据进行伪标记;最后,利用伪标记数据和标记数据组成的混合样本来训练预测DXN发射浓度的射频模型。通过实际的DXN数据集验证了该方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信