Seigel, MS and Woodland, PC (2011) Combining information sources for confidence estimation with CRF models. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. pp. 905-908.Full text not available from this repository.
Obtaining accurate confidence measures for automatic speech recognition (ASR) transcriptions is an important task which stands to benefit from the use of multiple information sources. This paper investigates the application of conditional random field (CRF) models as a principled technique for combining multiple features from such sources. A novel method for combining suitably defined features is presented, allowing for confidence annotation using lattice-based features of hypotheses other than the lattice 1-best. The resulting framework is applied to different stages of a state-of-the-art large vocabulary speech recognition pipeline, and consistent improvements are shown over a sophisticated baseline system. Copyright © 2011 ISCA.
|Uncontrolled Keywords:||ASR system combination Conditional random fields Confidence estimation Speech recognition|
|Divisions:||Div F > Machine Intelligence|
|Depositing User:||Cron job|
|Date Deposited:||04 Feb 2015 23:02|
|Last Modified:||05 Feb 2015 07:09|