A novel method to predict water quality resilience using deep reinforcement learning in São Paulo, Brazil

Mahmudul Hasan, Ali Mohsin, M. Imani, L. Bittencourt
{"title":"A novel method to predict water quality resilience using deep reinforcement learning in São Paulo, Brazil","authors":"Mahmudul Hasan, Ali Mohsin, M. Imani, L. Bittencourt","doi":"10.1109/ICIET48527.2019.9290601","DOIUrl":null,"url":null,"abstract":"In this paper, a unique method for predicting water quality resilience has been demonstrated using deep reinforcement learning. We showed a step by step procedure to formalise a multi-objective Markov decision process (MOMDP) to make a simulated environment for predicting water quality resilience in the state of São Paulo, Brazil. A common approach to solve a historical dataset is supervised learning. However, if we turn the problem into a multi-objective optimisation problem in a dynamic environment, the traditional approach may not be suitable or can have difficulties to be implemented. Notwithstanding, if we want to address this problem using a simulated environment (e.g. gaming environment) and give an opportunity to the agent to learn from the environment by itself through trial and error method, it could have brought human level expertise in such a complex constrained environment. An intelligent agent traverses this environment to determine the compromising solutions among objectives to predict resilient areas. The outcome revealed that a water quality dataset can be modelled into a reinforcement learning settings that is predominantly used to solve sequential decision making. This innovative method may solve many other intractable engineering problems with the help of deep reinforcement learning.","PeriodicalId":427838,"journal":{"name":"2019 2nd International Conference on Innovation in Engineering and Technology (ICIET)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 2nd International Conference on Innovation in Engineering and Technology (ICIET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIET48527.2019.9290601","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper, a unique method for predicting water quality resilience has been demonstrated using deep reinforcement learning. We showed a step by step procedure to formalise a multi-objective Markov decision process (MOMDP) to make a simulated environment for predicting water quality resilience in the state of São Paulo, Brazil. A common approach to solve a historical dataset is supervised learning. However, if we turn the problem into a multi-objective optimisation problem in a dynamic environment, the traditional approach may not be suitable or can have difficulties to be implemented. Notwithstanding, if we want to address this problem using a simulated environment (e.g. gaming environment) and give an opportunity to the agent to learn from the environment by itself through trial and error method, it could have brought human level expertise in such a complex constrained environment. An intelligent agent traverses this environment to determine the compromising solutions among objectives to predict resilient areas. The outcome revealed that a water quality dataset can be modelled into a reinforcement learning settings that is predominantly used to solve sequential decision making. This innovative method may solve many other intractable engineering problems with the help of deep reinforcement learning.
在巴西圣保罗,一种使用深度强化学习预测水质弹性的新方法
在本文中,一种独特的预测水质弹性的方法已经证明使用深度强化学习。我们展示了一步一步的程序来形式化多目标马尔可夫决策过程(MOMDP),以创建一个模拟环境来预测巴西圣保罗州的水质弹性。解决历史数据集的一种常用方法是监督学习。然而,如果我们把问题变成一个动态环境中的多目标优化问题,传统的方法可能不适合或难以实现。尽管如此,如果我们想要使用模拟环境(例如游戏环境)来解决这个问题,并给代理一个通过试错法从环境中学习的机会,它可以在这样一个复杂的约束环境中带来人类水平的专业知识。智能代理遍历该环境,确定目标之间的折衷解决方案,以预测弹性区域。结果表明,水质数据集可以建模为强化学习设置,主要用于解决顺序决策。这种创新的方法可以在深度强化学习的帮助下解决许多其他棘手的工程问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信