Abstract
Linear programming formulations for the discounted and long-run average MDPs have evolved along separate trajectories. In 2006, E. Altman conjectured that the two linear programming formulations of discounted and long-run average MDPs are, most likely, a manifestation of general properties of singularly perturbed linear programs. In this note we demonstrate that this is, indeed, the case.
| Original language | English |
|---|---|
| Pages (from-to) | 297-301 |
| Number of pages | 5 |
| Journal | Operations Research Letters |
| Volume | 44 |
| Issue number | 3 |
| DOIs | |
| Publication status | Published - 1 May 2016 |
Keywords
- Discounted MDPs
- Limiting linear program
- Long-run average MDPs
- Markov Decision Processes (MDPs)
- Singularly perturbed linear programs