Vol. 21 No. 2 (2011)
Artículos de Investigación

Nuevo método de aceleración de los procesos de decisión de Markov

Published 2011-08-01

Keywords

  • Markov decision processes,
  • topological ordering,
  • shortest path.
  • Procesos de decisión de Markov,
  • ordenamiento topológico,
  • ruta más corta.

How to Cite

García-Hernández, M. de G., Ruiz-Pinales, J., Ledesma-Orozco, S., Aviña-Cervantes, J. G., & Alvarado-Méndez, E. (2011). Nuevo método de aceleración de los procesos de decisión de Markov. Acta Universitaria, 21(2), 50–57. https://doi.org/10.15174/au.2011.38

Abstract

In this paper we propose a new acceleration method for solving Markov decision processes. Value iteration is a classical algorithm for solving Markov decision processes, but this algorithm and its variants are quite slow for discount factors close to one and their convergence properties depend to a great extent on a good state update order. Recently, it has been shown that improved topological value iteration presents a good convergence speed thanks to the use of an improved topological ordering algorithm. Nevertheless, the drawback of this algorithm is due to its memory requirements. So, we present a different method to obtain a good state backup order with less memory requirements. Experimental results obtained on a stochastic shortest path problem are presented.