WEN Xin, XU Xiaoya, GUO Xianping. Nonstationary Markov Decision Processes with Risk Probability Criteria[J]. Chinese Journal of Applied Probability and Statistics, 2023, 39(4): 589-603. DOI: 10.3969/j.issn.1001-4268.2023.04.009
Citation: WEN Xin, XU Xiaoya, GUO Xianping. Nonstationary Markov Decision Processes with Risk Probability Criteria[J]. Chinese Journal of Applied Probability and Statistics, 2023, 39(4): 589-603. DOI: 10.3969/j.issn.1001-4268.2023.04.009

Nonstationary Markov Decision Processes with Risk Probability Criteria

  • This paper considers a risk probability minimization problem for nonstationary discrete-time Markov decision processes, in which the transition probabilities and the reward functions depend on time. Different from the expected reward/cost criteria in the existing literature, the optimality performance here is to minimize the probability that the total rewards do not reach a given profit goal until the first passage time to some target set. Under mild reasonable conditions, we establish the corresponding optimality equations, verify that the sequence of the optimal risk functions is the unique solution to the optimality equations, and prove the existence of an optimal Markov policy.
  • loading

Catalog

    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return