Aggregation-disaggregation algorithm for ε2-singularly perturbed limiting average Markov control problems

Mohammed Abbad, Jerzy A. Filar

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Finite state and action Markov decision processes (MDPs) are dynamic, stochastic systems controlled by a controller. These models are usually referred to as Markovian control problems (MCPs). The authors consider a singular perturbation of order 2 for a Markov decision process with the limiting average reward criterion. They define a singular perturbation of order 2 in the following sense: it is assumed that the underlying process is composed of n separate irreducible processes, and that a small ε-perturbation is such that it unites these processes into m separate irreducible processes. Then another small ε2-pertubration is such that it unites these latter processes into a single irreducible process. The singular perturbation of order 2 is formulated. The limit MCP that is entirely different from the original unperturbed MDP, which forms an appropriate asymptotic approximation to a whole family of perturbed problems, is given explicitly. Thus, only the single limit MCP needs to be solved. An aggregation-disaggregation algorithm is constructed for solving the limit MCP.

Original languageEnglish
Title of host publicationProceedings of the 30th IEEE Conference on Decision and Control
PublisherInstitute of Electrical and Electronics Engineers
Pages465-470
Number of pages6
ISBN (Print)0780304500
DOIs
Publication statusPublished - Jan 1992
Externally publishedYes
Event30th IEEE Conference on Decision and Control - Brighton, United Kingdom
Duration: 11 Dec 199113 Dec 1991

Publication series

NameProceedings of the IEEE Conference on Decision and Control
ISSN (Print)0191-2216

Conference

Conference30th IEEE Conference on Decision and Control
Country/TerritoryUnited Kingdom
CityBrighton
Period11/12/9113/12/91

Fingerprint

Dive into the research topics of 'Aggregation-disaggregation algorithm for ε2-singularly perturbed limiting average Markov control problems'. Together they form a unique fingerprint.

Cite this