A novel method to predict water quality resilience using deep reinforcement learning in São Paulo, Brazil

2019 2nd International Conference on Innovation in Engineering and Technology (ICIET) Pub Date : 2019-12-23 DOI:10.1109/ICIET48527.2019.9290601

Mahmudul Hasan, Ali Mohsin, M. Imani, L. Bittencourt

{"title":"A novel method to predict water quality resilience using deep reinforcement learning in São Paulo, Brazil","authors":"Mahmudul Hasan, Ali Mohsin, M. Imani, L. Bittencourt","doi":"10.1109/ICIET48527.2019.9290601","DOIUrl":null,"url":null,"abstract":"In this paper, a unique method for predicting water quality resilience has been demonstrated using deep reinforcement learning. We showed a step by step procedure to formalise a multi-objective Markov decision process (MOMDP) to make a simulated environment for predicting water quality resilience in the state of São Paulo, Brazil. A common approach to solve a historical dataset is supervised learning. However, if we turn the problem into a multi-objective optimisation problem in a dynamic environment, the traditional approach may not be suitable or can have difficulties to be implemented. Notwithstanding, if we want to address this problem using a simulated environment (e.g. gaming environment) and give an opportunity to the agent to learn from the environment by itself through trial and error method, it could have brought human level expertise in such a complex constrained environment. An intelligent agent traverses this environment to determine the compromising solutions among objectives to predict resilient areas. The outcome revealed that a water quality dataset can be modelled into a reinforcement learning settings that is predominantly used to solve sequential decision making. This innovative method may solve many other intractable engineering problems with the help of deep reinforcement learning.","PeriodicalId":427838,"journal":{"name":"2019 2nd International Conference on Innovation in Engineering and Technology (ICIET)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 2nd International Conference on Innovation in Engineering and Technology (ICIET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIET48527.2019.9290601","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In this paper, a unique method for predicting water quality resilience has been demonstrated using deep reinforcement learning. We showed a step by step procedure to formalise a multi-objective Markov decision process (MOMDP) to make a simulated environment for predicting water quality resilience in the state of São Paulo, Brazil. A common approach to solve a historical dataset is supervised learning. However, if we turn the problem into a multi-objective optimisation problem in a dynamic environment, the traditional approach may not be suitable or can have difficulties to be implemented. Notwithstanding, if we want to address this problem using a simulated environment (e.g. gaming environment) and give an opportunity to the agent to learn from the environment by itself through trial and error method, it could have brought human level expertise in such a complex constrained environment. An intelligent agent traverses this environment to determine the compromising solutions among objectives to predict resilient areas. The outcome revealed that a water quality dataset can be modelled into a reinforcement learning settings that is predominantly used to solve sequential decision making. This innovative method may solve many other intractable engineering problems with the help of deep reinforcement learning.

查看原文本刊更多论文

在巴西圣保罗，一种使用深度强化学习预测水质弹性的新方法

在本文中，一种独特的预测水质弹性的方法已经证明使用深度强化学习。我们展示了一步一步的程序来形式化多目标马尔可夫决策过程(MOMDP)，以创建一个模拟环境来预测巴西圣保罗州的水质弹性。解决历史数据集的一种常用方法是监督学习。然而，如果我们把问题变成一个动态环境中的多目标优化问题，传统的方法可能不适合或难以实现。尽管如此，如果我们想要使用模拟环境(例如游戏环境)来解决这个问题，并给代理一个通过试错法从环境中学习的机会，它可以在这样一个复杂的约束环境中带来人类水平的专业知识。智能代理遍历该环境，确定目标之间的折衷解决方案，以预测弹性区域。结果表明，水质数据集可以建模为强化学习设置，主要用于解决顺序决策。这种创新的方法可以在深度强化学习的帮助下解决许多其他棘手的工程问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 2nd International Conference on Innovation in Engineering and Technology (ICIET)

自引率

0.00%

发文量