Abstract
Linear programming formulations for the discounted and long-run average MDPs have evolved along separate trajectories. In 2006, E. Altman conjectured that the two linear programming formulations of discounted and long-run average MDPs are, most likely, a manifestation of general properties of singularly perturbed linear programs. In this note we demonstrate that this is, indeed, the case.
Original language | English |
---|---|
Pages (from-to) | 297-301 |
Number of pages | 5 |
Journal | Operations Research Letters |
Volume | 44 |
Issue number | 3 |
DOIs | |
Publication status | Published - 1 May 2016 |
Keywords
- Discounted MDPs
- Limiting linear program
- Long-run average MDPs
- Markov Decision Processes (MDPs)
- Singularly perturbed linear programs