Singh, SS and Tadic, V and Doucet, A (2005) A policy gradient method for semi-Markov decision processes with application to call admission control. Technical Report. Cambridge University Engineering Department, Cambridge, UK.Full text not available from this repository.
|Item Type:||Monograph (Technical Report)|
|Divisions:||Div F > Signal Processing and Communications|
|Depositing User:||Cron Job|
|Date Deposited:||18 May 2016 18:22|
|Last Modified:||24 Aug 2016 03:16|