CUED Publications database

The CU-HTK Mandarin broadcast news transcription system

Sinha, R and Gales, MJF and Kim, DY and Liu, XA and Sim, KC and Woodland, PC (2006) The CU-HTK Mandarin broadcast news transcription system. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1. I1077-I1080. ISSN 1520-6149

Full text not available from this repository.

Abstract

This paper discusses the development of the CU-HTK Mandarin Broadcast News (BN) transcription system. The Mandarin BN task includes a significant amount of English data. Hence techniques have been investigated to allow the same system to handle both Mandarin and English by augmenting the Mandarin training sets with English acoustic and language model training data. A range of acoustic models were built including models based on Gaussianised features, speaker adaptive training and feature-space MPE. A multi-branch system architecture is described in which multiple acoustic model types, alternate phone sets and segmentations can be used in a system combination framework to generate the final output. The final system shows state-of-the-art performance over a range of test sets. ©2006 British Crown Copyright.

Item Type: Article
Subjects: UNSPECIFIED
Divisions: Div F > Machine Intelligence
Depositing User: Cron Job
Date Deposited: 07 Mar 2014 11:53
Last Modified: 27 Nov 2014 19:19
DOI: