Singh, SS and Tadic, V and Doucet, A (2005) A policy gradient method for semi-Markov decision processes with application to call admission control. Technical Report. Cambridge University Engineering Department, Cambridge, UK.Full text not available from this repository.
|Item Type:||Monograph (Technical Report)|
|Divisions:||Div F > Signal Processing and Communications|
|Depositing User:||Cron Job|
|Date Deposited:||09 Dec 2016 18:12|
|Last Modified:||23 Jan 2017 10:31|