Reinforcement Learning and Stochastic Optimization

Warrren B Powell
{"title":"Reinforcement Learning and Stochastic Optimization","authors":"Warrren B Powell","doi":"10.1002/9781119815068","DOIUrl":null,"url":null,"abstract":"REINFORCEMENT LEARNING AND STOCHASTIC OPTIMIZATION Clearing the jungle of stochastic optimization Sequential decision problems, which consist of \"decision, information, decision, information,” are ubiquitous, spanning virtually every human activity ranging from business applications, health (personal and public health, and medical decision making), energy, the sciences, all fields of engineering, finance, and e-commerce. The diversity of applications attracted the attention of at least 15 distinct fields of research, using eight distinct notational systems which produced a vast array of analytical tools. A byproduct is that powerful tools developed in one community may be unknown to other communities.Reinforcement Learning and Stochastic Optimization offers a single canonical framework that can model any sequential decision problem using five core components: state variables, decision variables, exogenous information variables, transition function, and objective function. This book highlights twelve types of uncertainty that might enter any model and pulls together the diverse set of methods for making decisions, known as policies, into four fundamental classes that span every method suggested in the academic literature or used in practice.Reinforcement Learning and Stochastic Optimization is the first book to provide a balanced treatment of the different methods for modeling and solving sequential decision problems, following the style used by most books on machine learning, optimization, and simulation. The presentation is designed for readers with a course in probability and statistics, and an interest in modeling and applications. Linear programming is occasionally used for specific problem classes. The book is designed for readers who are new to the field, as well as those with some background in optimization under uncertainty.Throughout this book, readers will find references to over 100 different applications, spanning pure learning problems, dynamic resource allocation problems, general state-dependent problems, and hybrid learning/resource allocation problems such as those that arose in the COVID pandemic. There are 370 exercises, organized into seven groups, ranging from review questions, modeling, computation, problem solving, theory, programming exercises and a \"diary problem\" that a reader chooses at the beginning of the book, and which is used as a basis for questions throughout the rest of the book. © 2022 John Wiley & Sons, Inc. All rights reserved.","PeriodicalId":334478,"journal":{"name":"Reinforcement Learning and Stochastic Optimization: A Unified Framework for Sequential Decisions","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"41","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reinforcement Learning and Stochastic Optimization: A Unified Framework for Sequential Decisions","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/9781119815068","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 41

Abstract

REINFORCEMENT LEARNING AND STOCHASTIC OPTIMIZATION Clearing the jungle of stochastic optimization Sequential decision problems, which consist of "decision, information, decision, information,” are ubiquitous, spanning virtually every human activity ranging from business applications, health (personal and public health, and medical decision making), energy, the sciences, all fields of engineering, finance, and e-commerce. The diversity of applications attracted the attention of at least 15 distinct fields of research, using eight distinct notational systems which produced a vast array of analytical tools. A byproduct is that powerful tools developed in one community may be unknown to other communities.Reinforcement Learning and Stochastic Optimization offers a single canonical framework that can model any sequential decision problem using five core components: state variables, decision variables, exogenous information variables, transition function, and objective function. This book highlights twelve types of uncertainty that might enter any model and pulls together the diverse set of methods for making decisions, known as policies, into four fundamental classes that span every method suggested in the academic literature or used in practice.Reinforcement Learning and Stochastic Optimization is the first book to provide a balanced treatment of the different methods for modeling and solving sequential decision problems, following the style used by most books on machine learning, optimization, and simulation. The presentation is designed for readers with a course in probability and statistics, and an interest in modeling and applications. Linear programming is occasionally used for specific problem classes. The book is designed for readers who are new to the field, as well as those with some background in optimization under uncertainty.Throughout this book, readers will find references to over 100 different applications, spanning pure learning problems, dynamic resource allocation problems, general state-dependent problems, and hybrid learning/resource allocation problems such as those that arose in the COVID pandemic. There are 370 exercises, organized into seven groups, ranging from review questions, modeling, computation, problem solving, theory, programming exercises and a "diary problem" that a reader chooses at the beginning of the book, and which is used as a basis for questions throughout the rest of the book. © 2022 John Wiley & Sons, Inc. All rights reserved.
强化学习与随机优化
由“决策、信息、决策、信息”组成的顺序决策问题无处不在,几乎跨越了从商业应用、健康(个人和公共健康,以及医疗决策)、能源、科学、工程、金融和电子商务的所有领域的每一项人类活动。应用的多样性吸引了至少15个不同研究领域的注意,使用了8种不同的符号系统,产生了大量的分析工具。副产品是一个社区开发的强大工具可能不为其他社区所知。强化学习和随机优化提供了一个单一的规范框架,可以使用五个核心组件来建模任何顺序决策问题:状态变量、决策变量、外生信息变量、转移函数和目标函数。这本书强调了可能进入任何模型的12种不确定性类型,并将制定决策的各种方法(称为政策)汇集在一起,分为四个基本类,涵盖了学术文献中建议的每种方法或在实践中使用的方法。《强化学习和随机优化》是第一本平衡处理不同方法的书,用于建模和解决顺序决策问题,遵循大多数关于机器学习、优化和模拟的书籍所使用的风格。该报告是专为读者在概率和统计课程,并在建模和应用的兴趣。线性规划有时用于特定的问题类。这本书是专为读者谁是新的领域,以及那些在不确定性下优化的一些背景。在这本书中,读者将找到100多个不同应用的参考文献,涵盖纯学习问题、动态资源分配问题、一般状态依赖问题以及混合学习/资源分配问题,如COVID大流行中出现的问题。本书共有370个习题,分为七组,包括复习题、建模题、计算题、解题题、理论题、编程题和一个读者在本书开头选择的“日记题”,这是本书其余部分问题的基础。©2022 John Wiley & Sons, Inc版权所有。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信