Singh, SS and Tadic, V and Doucet, A (2005) A policy gradient method for semi-Markov decision processes with application to call admission control. Technical Report. Cambridge University Engineering Department, Cambridge, UK.Full text not available from this repository.
|Item Type:||Monograph (Technical Report)|
|Divisions:||Div F > Signal Processing and Communications|
|Depositing User:||Cron Job|
|Date Deposited:||15 Dec 2015 14:00|
|Last Modified:||06 Feb 2016 02:34|