建筑管理系统中文本分类的机器学习

IF 4.3 3区 工程技术 Q1 ENGINEERING, CIVIL
J. J. Mesa-Jiménez, L. Stokes, QingPing Yang, V. Livina
{"title":"建筑管理系统中文本分类的机器学习","authors":"J. J. Mesa-Jiménez, L. Stokes, QingPing Yang, V. Livina","doi":"10.3846/jcem.2022.16012","DOIUrl":null,"url":null,"abstract":"In building management systems (BMS), a medium building may have between 200 and 1000 sensor points. Their labels need to be translated into a naming standard so they can be automatically recognised by the BMS platform. The current industrial practices often manually translate these points into labels (this is known as the tagging process), which takes around 8 hours for every 100 points. We introduce an AI-based multi-stage text classification that translates BMS points into formatted BMS labels. After comparing five different techniques for text classification (logistic regression, random forests, XGBoost, multinomial Naive Bayes and linear support vector classification), we demonstrate that XGBoost is the top performer with 90.29% of true positives, and use the prediction confidence to filter out false positives. This approach can be applied in sensors networks in various applications, where manual free-text data pre-processing remains cumbersome.","PeriodicalId":15524,"journal":{"name":"Journal of Civil Engineering and Management","volume":" ","pages":""},"PeriodicalIF":4.3000,"publicationDate":"2022-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"MACHINE LEARNING FOR TEXT CLASSIFICATION IN BUILDING MANAGEMENT SYSTEMS\",\"authors\":\"J. J. Mesa-Jiménez, L. Stokes, QingPing Yang, V. Livina\",\"doi\":\"10.3846/jcem.2022.16012\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In building management systems (BMS), a medium building may have between 200 and 1000 sensor points. Their labels need to be translated into a naming standard so they can be automatically recognised by the BMS platform. The current industrial practices often manually translate these points into labels (this is known as the tagging process), which takes around 8 hours for every 100 points. We introduce an AI-based multi-stage text classification that translates BMS points into formatted BMS labels. After comparing five different techniques for text classification (logistic regression, random forests, XGBoost, multinomial Naive Bayes and linear support vector classification), we demonstrate that XGBoost is the top performer with 90.29% of true positives, and use the prediction confidence to filter out false positives. This approach can be applied in sensors networks in various applications, where manual free-text data pre-processing remains cumbersome.\",\"PeriodicalId\":15524,\"journal\":{\"name\":\"Journal of Civil Engineering and Management\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2022-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Civil Engineering and Management\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.3846/jcem.2022.16012\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Civil Engineering and Management","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3846/jcem.2022.16012","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 1

摘要

在建筑物管理系统(BMS)中,中型建筑物可以具有200到1000个传感器点。它们的标签需要翻译成命名标准,以便BMS平台自动识别。目前的工业实践通常手动将这些点转换为标签(这被称为标记过程),每100个点大约需要8个小时。我们介绍了一种基于人工智能的多阶段文本分类,将BMS点转换为格式化的BMS标签。在比较了五种不同的文本分类技术(逻辑回归、随机森林、XGBoost、多项式朴素贝叶斯和线性支持向量分类)后,我们证明XGBoost是表现最好的,其真阳性率为90.29%,并使用预测置信度来过滤假阳性。这种方法可以应用于各种应用中的传感器网络,其中手动自由文本数据预处理仍然很麻烦。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
MACHINE LEARNING FOR TEXT CLASSIFICATION IN BUILDING MANAGEMENT SYSTEMS
In building management systems (BMS), a medium building may have between 200 and 1000 sensor points. Their labels need to be translated into a naming standard so they can be automatically recognised by the BMS platform. The current industrial practices often manually translate these points into labels (this is known as the tagging process), which takes around 8 hours for every 100 points. We introduce an AI-based multi-stage text classification that translates BMS points into formatted BMS labels. After comparing five different techniques for text classification (logistic regression, random forests, XGBoost, multinomial Naive Bayes and linear support vector classification), we demonstrate that XGBoost is the top performer with 90.29% of true positives, and use the prediction confidence to filter out false positives. This approach can be applied in sensors networks in various applications, where manual free-text data pre-processing remains cumbersome.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
6.70
自引率
4.70%
发文量
0
审稿时长
1.7 months
期刊介绍: The Journal of Civil Engineering and Management is a peer-reviewed journal that provides an international forum for the dissemination of the latest original research, achievements and developments. We publish for researchers, designers, users and manufacturers in the different fields of civil engineering and management. The journal publishes original articles that present new information and reviews. Our objective is to provide essential information and new ideas to help improve civil engineering competency, efficiency and productivity in world markets. The Journal of Civil Engineering and Management publishes articles in the following fields: building materials and structures, structural mechanics and physics, geotechnical engineering, road and bridge engineering, urban engineering and economy, constructions technology, economy and management, information technologies in construction, fire protection, thermoinsulation and renovation of buildings, labour safety in construction.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信