Convergence Problem of a Sequence of First Passage Markov Decision Processes with Varying Discount Factors
-
Graphical Abstract
-
Abstract
In this paper, we study the convergence problem of a sequence of first passage Markov decision processes with constraints and varying discount factors. Using the ``occupation measures'' and its related properties, we transform the constrained optimality problems into linear programming problems on the set of occupation measures (i.e., the convex analytic approach), and then prove that the optimal values and optimal policies of the original first passage Markov decision processes converge respectively to those of the ``limit'' one.
-
-