CUED Publications database

Joint optimisation of tandem systems using Gaussian mixture density neural network discriminative sequence training

Zhang, C and Woodland, PC (2017) Joint optimisation of tandem systems using Gaussian mixture density neural network discriminative sequence training. In: UNSPECIFIED pp. 5015-5019..

Full text not available from this repository.

Abstract

© 2017 IEEE. The use of deep neural networks (DNNs) for feature extraction and Gaussian mixture models (GMMs) for acoustic modelling is often termed a tandem system configuration and can be viewed as a Gaussian mixture density neural network (MDNN). Compared to the direct use of DNN output probabilities in the acoustic model, the tandem approach suffers from a major weakness in that the feature extraction stage and the final acoustic models are optimised separately. This paper proposes a joint optimisation approach to all the stages of the tandem acoustic model by using MDNN discriminative sequence training. A set of techniques is used to improve the training performance and stability. Experiments using the multi-genre broadcast (MGB) English data show that the proposed method produced a 6% relative lower word error rate (WER) than that of a traditional discriminatively trained tandem system. The resulting jointly optimised tandem systems are comparable in WER to hybrid DNN systems optimised using discriminative sequence training with the same number of parameters.

Item Type: Conference or Workshop Item (UNSPECIFIED)
Subjects: UNSPECIFIED
Divisions: Div F > Machine Intelligence
Depositing User: Cron Job
Date Deposited: 30 Aug 2017 20:06
Last Modified: 05 Sep 2017 01:51
DOI: