Gradient-based policy iteration: an example - Hong Kong University of ...

6 downloads 70 Views 353KB Size Report
ent states are correlated and hence the standard policy iteration cannot apply. We illustrate the main ideas with an example of M/G/l/N queue and identify some.