Mairesse, F and Gašić, M and Jurčíček, F and Keizer, S and Thomson, B and Yu, K and Young, S (2010) Phrase-based statistical language generation using graphical models and active learning. ACL 2010 - 48th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference. pp. 1552-1561.Full text not available from this repository.
Most previous work on trainable language generation has focused on two paradigms: (a) using a statistical model to rank a set of generated utterances, or (b) using statistics to inform the generation decision process. Both approaches rely on the existence of a handcrafted generator, which limits their scalability to new domains. This paper presents BAGEL, a statistical language generator which uses dynamic Bayesian networks to learn from semantically-aligned data produced by 42 untrained annotators. A human evaluation shows that BAGEL can generate natural and informative utterances from unseen inputs in the information presentation domain. Additionally, generation performance on sparse datasets is improved significantly by using certainty-based active learning, yielding ratings close to the human gold standard with a fraction of the data. © 2010 Association for Computational Linguistics.
|Divisions:||Div F > Machine Intelligence|
|Depositing User:||Cron Job|
|Date Deposited:||18 May 2016 18:09|
|Last Modified:||25 Aug 2016 10:45|