We consider a system with a finite number of states, 1, 2, ⋯, S. Periodically we observe the current state of the system and perform an action, a, from a finite set A of possible actions. As a joint ...
The main aim of the present work is to establish connections between the theory of dynamic programming and the statistical decision theory. The paper deals with a nonMarkovian dynamic programming ...