谷歌浏览器插件
订阅小程序
在清言上使用

Dynamic Programming For Data Independent Decision Sets

JOURNAL OF CONVEX ANALYSIS(2023)

引用 0|浏览1
暂无评分
摘要
Multistage stochastic optimization problems are oftentimes formulated informally in a pathwise way. These formulations are appealing in a discrete setting and suitable when addressing computational challenges, for example. But the pathwise problem statement does not allow an analysis with mathematical rigor and is therefore not appropriate. R. T. Rockafellar and R. J.-B. Wets [Nonanticipativity and L-1-martingales in stochastic optimization problems, Math. Programming Study 6 (1976) 170-187] address the fundamental measurability concern of the value functions in the case of convex costs and constraints. This paper resumes these foundations. The contribution is a proof that there exist measurable versions of intermediate value functions, which reveals regularity in addition. Our proof builds on the Kolmogorov continuity theorem. It is demonstrated that verification theorems allow stating traditional problem specifications in the novel setting with mathematical rigor. Further, we provide dynamic equations for the general problem. The problem classes covered include Markov decision processes, reinforcement learning and stochastic dual dynamic programming.
更多
查看译文
关键词
Multistage stochastic optimization,stochastic processes,measurability
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要