Normal view MARC view ISBD view

Reinforcement learning and stochastic optimization : a unified framework for sequential decisions

By: Powell, Warren B.

Publisher: Hoboken John Wiley 2022Description: xxxiv, 1099p.ISBN: 9781119815037.Subject(s): Reinforcement learning | Decision making -- Statistical methods | Mathematical optimization | Stochastic analysisDDC classification: 006.3 | P871r Summary: "The first step in sequential decision problems is to understand what decisions are being made. It is surprising how often it is that people faced with complex problems, which spans scientists in a lab to people trying to solve major health problems, are not able to identify the decisions they face. We then want to find a method for making decisions. There are at least 45 words in the English language that are equivalent to "method for making a decision," but the one we have settled on is policy. The term policy is very familiar to fields such as Markov decision processes and reinforcement learning, but with a much narrower interpretation than we will use. Other fields do not use the term at all. Designing effective policies will be the focus of most of this book. Even more subtle is identifying the different sources of uncertainty. It can be hard enough trying to identify potential decisions, but thinking about all the random events that might affect whatever it is that you are managing, whether it is reducing disease, managing inventories, or making investments, can seem like a hopeless challenge"

List(s) this item appears in: New arrivals February 10 to 16, 2025

average rating: 0.0 (0 votes)

Holdings ( 1 )
Title notes
Comments ( 0 )

Item type	Current location	Collection	Call number	Status	Date due	Barcode	Item holds
Books	PK Kelkar Library, IIT Kanpur	On Display	006.3 P871r (Browse shelf)	Available		A186802

Total holds: 0

Browsing PK Kelkar Library, IIT Kanpur Shelves , Collection code: On Display Close shelf browser

					Next
	006.3 P871r Reinforcement learning and stochastic optimization	515.353 H119v Variational convergence and stochastic homogenization of nonlinear reaction-diffusion problems	515.353 M42 Mathematical theory of evolutionary fluid-flow structure interactions	515.353 Sh45p Periodic homogenization of elliptic systems	Next

"The first step in sequential decision problems is to understand what decisions are being made. It is surprising how often it is that people faced with complex problems, which spans scientists in a lab to people trying to solve major health problems, are not able to identify the decisions they face. We then want to find a method for making decisions. There are at least 45 words in the English language that are equivalent to "method for making a decision," but the one we have settled on is policy. The term policy is very familiar to fields such as Markov decision processes and reinforcement learning, but with a much narrower interpretation than we will use. Other fields do not use the term at all. Designing effective policies will be the focus of most of this book. Even more subtle is identifying the different sources of uncertainty. It can be hard enough trying to identify potential decisions, but thinking about all the random events that might affect whatever it is that you are managing, whether it is reducing disease, managing inventories, or making investments, can seem like a hopeless challenge"

There are no comments for this item.

PKK Library

Welcome to P K Kelkar Library, Online Public Access Catalogue (OPAC)

Reinforcement learning and stochastic optimization : a unified framework for sequential decisions

By: Powell, Warren B.

Browsing PK Kelkar Library, IIT Kanpur Shelves , Collection code: On Display Close shelf browser