Abstract
The authors consider a singularly perturbed Markov decision process (MDP) with the limiting average cost criterion. It is assumed that the underlying process is composed of n separate irreducible processes, and that the small perturbation is such that it 'unites' these processes into a single irreducible process. This structure corresponds to the Markov chains admitting strong and weak interactions. The authors introduce the formulation and some results given by T. R. Bielecki and J. A. Filar (1989) for the underlying control problem for the singularly perturbed MDP, the so-called limit Markov control problem (limit MCP). It is demonstrated here that the limit MCP can be solved by a suitably constructed linear program. An algorithm for solving the limit MCP based on the policy improvement method is constructed.
Original language | English |
---|---|
Pages (from-to) | 1402-1407 |
Number of pages | 7 |
Journal | Proceedings of the IEEE Conference on Decision and Control |
Volume | 3 |
DOIs | |
Publication status | Published - 1990 |
Externally published | Yes |
Event | 29th IEEE Conference on Decision and Control - Honolulu, HI, USA Duration: 5 Dec 1990 → 7 Dec 1990 |