Statistical Inference and Analysis for Efficient Modeling of Environmental Pollution using Deep Neural Networks

2022 Eighth Indian Control Conference (ICC) Pub Date : 2022-12-14 DOI:10.1109/ICC56513.2022.10093411

Chilukuri Lakshmi Sravani, S. Miriyala, K. Mitra

{"title":"Statistical Inference and Analysis for Efficient Modeling of Environmental Pollution using Deep Neural Networks","authors":"Chilukuri Lakshmi Sravani, S. Miriyala, K. Mitra","doi":"10.1109/ICC56513.2022.10093411","DOIUrl":null,"url":null,"abstract":"Rapid development, due to industrialization and urbanization happening worldwide, has become a prominent cause of air pollution. In such a situation, it is important to create an air quality prediction model development methodology, which not only models the data but also provides inferences understandable to the policymakers. Therefore, in this research, a new methodology has been proposed, where the prediction model is created by combining the concepts of Statistical Inferencing and Deep Learning [Gated Recurring Units (GRU)]. Hourly air pollutants concentration and meteorological data with 14 features measured over one year from 25 different monitoring stations in Northern Taiwan are considered as the dataset. Using methodologies such as Analysis of Variance, Tukey Honestly Significant Difference, Graph theory, and Chi-Square analysis, the voluminous dataset is first clustered based on geographical correlations, and for each cluster, the most significant features responsible for modulating Particulate Matter (PM10) concentrations are identified. Subsequently, the new datasets obtained through the statistical study are used to train the GRU model for final predictions. The proposed model has exhibited an overall accuracy between 90.4% to 99.2% for all clusters. The generic nature of the proposed methodology allows for its extension to predict the transient behaviour of other pollutants across different geographical locations.","PeriodicalId":101654,"journal":{"name":"2022 Eighth Indian Control Conference (ICC)","volume":"57 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Eighth Indian Control Conference (ICC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICC56513.2022.10093411","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Rapid development, due to industrialization and urbanization happening worldwide, has become a prominent cause of air pollution. In such a situation, it is important to create an air quality prediction model development methodology, which not only models the data but also provides inferences understandable to the policymakers. Therefore, in this research, a new methodology has been proposed, where the prediction model is created by combining the concepts of Statistical Inferencing and Deep Learning [Gated Recurring Units (GRU)]. Hourly air pollutants concentration and meteorological data with 14 features measured over one year from 25 different monitoring stations in Northern Taiwan are considered as the dataset. Using methodologies such as Analysis of Variance, Tukey Honestly Significant Difference, Graph theory, and Chi-Square analysis, the voluminous dataset is first clustered based on geographical correlations, and for each cluster, the most significant features responsible for modulating Particulate Matter (PM10) concentrations are identified. Subsequently, the new datasets obtained through the statistical study are used to train the GRU model for final predictions. The proposed model has exhibited an overall accuracy between 90.4% to 99.2% for all clusters. The generic nature of the proposed methodology allows for its extension to predict the transient behaviour of other pollutants across different geographical locations.

查看原文本刊更多论文

基于深度神经网络的环境污染有效建模的统计推断与分析

由于工业化和城市化在世界范围内的快速发展，已经成为空气污染的一个突出原因。在这种情况下，创建空气质量预测模型开发方法非常重要，该方法不仅可以模拟数据，还可以为决策者提供可理解的推论。因此，在本研究中，提出了一种新的方法，其中通过结合统计推理和深度学习[门控循环单元(GRU)]的概念创建预测模型。每小时的空气污染物浓度和台湾北部25个不同监测站在一年内测量的14个特征的气象数据作为数据集。使用方差分析、显著差异分析、图论和卡方分析等方法，首先根据地理相关性对大量数据集进行聚类，对于每个聚类，确定了负责调节颗粒物(PM10)浓度的最重要特征。随后，通过统计研究获得的新数据集用于训练GRU模型进行最终预测。该模型对所有聚类的总体准确率在90.4%到99.2%之间。所建议方法的一般性质允许将其扩展到预测其他污染物在不同地理位置的瞬态行为。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 Eighth Indian Control Conference (ICC)

自引率

0.00%

发文量