Multi-view Deep Embedded Clustering: Exploring a new dimension of air pollution

IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Hassan Kassem , Sally El Hajjar , Fahed Abdallah , Hichem Omrani
{"title":"Multi-view Deep Embedded Clustering: Exploring a new dimension of air pollution","authors":"Hassan Kassem ,&nbsp;Sally El Hajjar ,&nbsp;Fahed Abdallah ,&nbsp;Hichem Omrani","doi":"10.1016/j.engappai.2024.109509","DOIUrl":null,"url":null,"abstract":"<div><div>Clustering is essential for uncovering hidden patterns and relationships in complex datasets. Its importance reveals when labeled data is scarce, expensive, time-consuming to obtain. Real-world applications often exhibit heterogeneity due to the diverse nature of the encapsulated data. This heterogeneity poses a significant challenge in data analysis, modeling, and makes traditional clustering methods ineffective. By adopting a hybrid architecture based on two promising techniques, multi-view and deep clustering, our method achieved better results, outperforming several existing methods including <em>K</em>-means, deep embedded clustering, deep clustering network, deep embedded <em>K</em>-means among many others. Multiple experiments conducted across diverse publicly accessible datasets validate the effectiveness of our proposed method based on well established evaluation metrics such as Accuracy and Normalized Mutual Information (NMI). Furthermore, we applied our method on the air pollution data of Luxembourg, a country with sparse sensor coverage. Our method demonstrated promising results, and unveil a new dimension that pave way for future work in air pollution’s level prediction and hotspots detection, crucial steps towards effective pollution reduction strategies.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624016671","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Clustering is essential for uncovering hidden patterns and relationships in complex datasets. Its importance reveals when labeled data is scarce, expensive, time-consuming to obtain. Real-world applications often exhibit heterogeneity due to the diverse nature of the encapsulated data. This heterogeneity poses a significant challenge in data analysis, modeling, and makes traditional clustering methods ineffective. By adopting a hybrid architecture based on two promising techniques, multi-view and deep clustering, our method achieved better results, outperforming several existing methods including K-means, deep embedded clustering, deep clustering network, deep embedded K-means among many others. Multiple experiments conducted across diverse publicly accessible datasets validate the effectiveness of our proposed method based on well established evaluation metrics such as Accuracy and Normalized Mutual Information (NMI). Furthermore, we applied our method on the air pollution data of Luxembourg, a country with sparse sensor coverage. Our method demonstrated promising results, and unveil a new dimension that pave way for future work in air pollution’s level prediction and hotspots detection, crucial steps towards effective pollution reduction strategies.
多视角深度嵌入式聚类:探索空气污染的新维度
聚类对于揭示复杂数据集中隐藏的模式和关系至关重要。当标记数据稀缺、昂贵且获取耗时时,聚类的重要性就显现出来了。由于封装数据的性质各不相同,现实世界中的应用往往表现出异质性。这种异质性给数据分析和建模带来了巨大挑战,并使传统的聚类方法失效。通过采用基于多视图和深度聚类这两种有前途的技术的混合架构,我们的方法取得了更好的效果,优于 K-means、深度嵌入式聚类、深度聚类网络、深度嵌入式 K-means 等多种现有方法。在不同的公开数据集上进行的多项实验验证了我们提出的方法的有效性,这些实验基于成熟的评估指标,如准确率和归一化互信息(NMI)。此外,我们还将我们的方法应用于传感器覆盖范围稀少的卢森堡的空气污染数据。我们的方法取得了可喜的成果,并揭示了一个新的维度,为今后在空气污染水平预测和热点检测方面的工作铺平了道路,而这正是有效减少污染战略的关键步骤。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Engineering Applications of Artificial Intelligence
Engineering Applications of Artificial Intelligence 工程技术-工程:电子与电气
CiteScore
9.60
自引率
10.00%
发文量
505
审稿时长
68 days
期刊介绍: Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信