非对称超市模型的报酬过程与性能优化研究

Reward Processes and Performance Optimization in Asymmetric Supermarket Models

  • 摘要: 超市模型具有操作简单、反应快速、实时管控等优点而成为研究大型网络资源管理的一个重要数学工具, 它已经在物联网、云计算、云制造、大数据、交通运输、医疗卫生等重要实际领域中获得了极为广泛的应用. 目前, 非对称超市模型是这个研究方向上的一个重要课题.
    在本文中, 我们研究了一个非对称超市模型. 由于M个服务台不相同, 所以到达顾客的路径选择策略表现得较为复杂: 它不仅与队长和服务速度有关, 而且也与服务台的信誉有关. 为此, 我们利用决策方法构造了非对称超市模型的路径选择策略. 基于此, 我们利用马氏报酬过程及其优化技术, 建立了这个非对称超市模型的泛函报酬方程, 并给出了这些泛函报酬方程的一个值递推算法; 通过对这个报酬函数的一个相向优化, 提供了这类非对称超市模型研究中的一个性能评价准则. 为了理解非对称超市模型是如何通过客观条件与主观行为来实施对大型网络资源进行有效管控, 本文的研究方法与结果在这个方向上首次提供了一些必要的理论依据.

     

    Abstract: The supermarket model has been an important mathematical tool in the study of resource management in large-scale networks by means of some advantages, such as, simple operations, quick reaction, real-time management and control and so on. It is widely applied in internet of things, cloud computing, cloud manufacturing, big data, transportation, health care and other important practical fields. Up to now, analysis of the asymmetric supermarket models is an increasingly interesting topic in this area.
    In this paper, we analyze an asymmetric supermarket model. Because the M servers are different from each other, the routine selection policies of each customer become to have a complex structure, where not only are the routine selection policies related to the different queue lengths and the different service speeds among the M servers, but they are also related to the customer's preference for the M servers. For this, we set up several useful routine selection policies in terms of the decision-making methods. Based on this, we provide the Markov reward processes of the asymmetric supermarket model and establish the associated functional reward equations, give a useful value iterative algorithm for solving the functional reward equations, obtain a criterion of performance evaluation in the asymmetric supermarket model through a double-direction optimization, and show that the sequence of iterative reward functions is monotone and the value iterative algorithm is convergent. This paper provides new and useful highlight on understanding how the asymmetric supermarket model is applied to resource management and control in large-scale networks both from the objective conditions and from the subjective behavior. At the same time, the methodology and main results of this paper give some basic theory and techniques in the study of asymmetric supermarket models for the first time.

     

/

返回文章
返回