Singh, SS and Tadic, V and Doucet, A (2005) A policy gradient method for semi-Markov decision processes with application to call admission control. Technical Report. Cambridge University Engineering Department, Cambridge, UK.Full text not available from this repository.
|Item Type:||Monograph (Technical Report)|
|Divisions:||Div F > Signal Processing and Communications|
|Depositing User:||Unnamed user with email email@example.com|
|Date Deposited:||18 May 2016 18:22|
|Last Modified:||27 Jul 2016 23:35|