SEMI-MARKOV DECISION PROCESSES WITH POLYNOMIAL REWARD.

Zvi Rosberg

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

A semi-Markov decision process, with a denumerable multidimensional state space, is considered. At any given state only a finite number of actions can be taken to control the process. The immediate reward earned in one transition period is merely assumed to be bounded by a polynomial and a bound is imposed on a weighted moment of the next state reached in one transition. It is shown that under an ergodicity assumption there is a stationary optimal policy for the long-run average reward criterion. A queueing network scheduling problem, for which previous criteria are inapplicable, is given as an application.

Original languageEnglish
Pages (from-to)301-309
Number of pages9
JournalJournal of Applied Probability
Volume19
Issue number2
DOIs
StatePublished - 1 Jan 1982

ASJC Scopus subject areas

  • Statistics and Probability
  • Mathematics (all)
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'SEMI-MARKOV DECISION PROCESSES WITH POLYNOMIAL REWARD.'. Together they form a unique fingerprint.

Cite this