Singh, SS and Tadic, V and Doucet, A (2005) A policy gradient method for semi-Markov decision processes with application to call admission control. Technical Report. Cambridge University Engineering Department, Cambridge, UK.Full text not available from this repository.
|Item Type:||Monograph (Technical Report)|
|Divisions:||Div F > Signal Processing and Communications|
|Depositing User:||Cron Job|
|Date Deposited:||02 Sep 2016 17:21|
|Last Modified:||01 Dec 2016 08:21|