{"title":"Multi-view Deep Embedded Clustering: Exploring a new dimension of air pollution","authors":"Hassan Kassem , Sally El Hajjar , Fahed Abdallah , Hichem Omrani","doi":"10.1016/j.engappai.2024.109509","DOIUrl":null,"url":null,"abstract":"<div><div>Clustering is essential for uncovering hidden patterns and relationships in complex datasets. Its importance reveals when labeled data is scarce, expensive, time-consuming to obtain. Real-world applications often exhibit heterogeneity due to the diverse nature of the encapsulated data. This heterogeneity poses a significant challenge in data analysis, modeling, and makes traditional clustering methods ineffective. By adopting a hybrid architecture based on two promising techniques, multi-view and deep clustering, our method achieved better results, outperforming several existing methods including <em>K</em>-means, deep embedded clustering, deep clustering network, deep embedded <em>K</em>-means among many others. Multiple experiments conducted across diverse publicly accessible datasets validate the effectiveness of our proposed method based on well established evaluation metrics such as Accuracy and Normalized Mutual Information (NMI). Furthermore, we applied our method on the air pollution data of Luxembourg, a country with sparse sensor coverage. Our method demonstrated promising results, and unveil a new dimension that pave way for future work in air pollution’s level prediction and hotspots detection, crucial steps towards effective pollution reduction strategies.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624016671","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Clustering is essential for uncovering hidden patterns and relationships in complex datasets. Its importance reveals when labeled data is scarce, expensive, time-consuming to obtain. Real-world applications often exhibit heterogeneity due to the diverse nature of the encapsulated data. This heterogeneity poses a significant challenge in data analysis, modeling, and makes traditional clustering methods ineffective. By adopting a hybrid architecture based on two promising techniques, multi-view and deep clustering, our method achieved better results, outperforming several existing methods including K-means, deep embedded clustering, deep clustering network, deep embedded K-means among many others. Multiple experiments conducted across diverse publicly accessible datasets validate the effectiveness of our proposed method based on well established evaluation metrics such as Accuracy and Normalized Mutual Information (NMI). Furthermore, we applied our method on the air pollution data of Luxembourg, a country with sparse sensor coverage. Our method demonstrated promising results, and unveil a new dimension that pave way for future work in air pollution’s level prediction and hotspots detection, crucial steps towards effective pollution reduction strategies.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.