Convergence Problem of a Sequence of First Passage Markov Decision Processes with Varying Discount Factors

WU Xiao; GUO Zhenbin

doi:10.3969/j.issn.1001-4268.2021.06.004

WU Xiao, GUO Zhenbin. Convergence Problem of a Sequence of First Passage Markov Decision Processes with Varying Discount FactorsJ. Chinese Journal of Applied Probability and Statistics, 2021, 37(6): 598-610.

Citation:

Convergence Problem of a Sequence of First Passage Markov Decision Processes with Varying Discount Factors

Graphical Abstract

Abstract

Abstract

In this paper, we study the convergence problem of a sequence of first passage Markov decision processes with constraints and varying discount factors. Using the ``occupation measures'' and its related properties, we transform the constrained optimality problems into linear programming problems on the set of occupation measures (i.e., the convex analytic approach), and then prove that the optimal values and optimal policies of the original first passage Markov decision processes converge respectively to those of the ``limit'' one.

FullText(HTML)

References (0)

Cited By

Turn off MathJax

Article Contents

Convergence Problem of a Sequence of First Passage Markov Decision Processes with Varying Discount Factors

Abstract

Catalog

Export File

Citation

Format

Content