Abstract for liao_tr499

Cambridge University Engineering Department Technical Report CUED/F-INFENG/TR499

UNCERTAINTY DECODING FOR NOISE ROBUST SPEECH RECOGNITION

Hank Liao and Mark Gales

October 2004

This report presents uncertainty decoding as a method for robust automatic speech recognition for the Noise Robust Automatic Speech Recognition project funded by Toshiba Research Europe Limited. The effects of noise on speech recognition are reviewed and a general framework for noise robust speech recognition introduced. Common and related noise robustness techniques are described in the context of this framework. Uncertainty decoding is also presented in this framework with the goal of providing fast noise compensation through the propagation of uncertainty to the decoder. Two forms are discussed, the Joint and SPLICE methods, and evaluated on the medium vocabulary Resource Management corpus at a range of arti^Lcially produced noise levels. It was found that the uncertainty decoding algorithms did not meet the performance of a matched system, but were more accurate than the baseline SPLICE enhancement technique and low numbers of CMLLR transforms.

(ftp:) liao_tr499.pdf (http:) liao_tr499.pdf

If you have difficulty viewing files that end '.gz', which are gzip compressed, then you may be able to find tools to uncompress them at the gzip web site.

If you have difficulty viewing files that are in PostScript, (ending '.ps' or '.ps.gz'), then you may be able to find tools to view them at the gsview web site.

We have attempted to provide automatically generated PDF copies of documents for which only PostScript versions have previously been available. These are clearly marked in the database - due to the nature of the automatic conversion process, they are likely to be badly aliased when viewed at default resolution on screen by acroread.