Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Factors Affecting ASR Model Self-Training

Add to your list(s) Download to your calendar using vCal

Scott Novotney (HLTCOE and BBN Technologies)
Tuesday 01 September 2009, 11:00-12:00
LR5, Engineering Department, Baker Building.

If you have a question about this talk, please contact Dr Marcus Tomalin.

Low-resource ASR self-training seeks to minimize resource requirements such as manual transcriptions or language modeling text. This is accomplished by training on large quantities of audio automatically labeled by a small initial model. By analyzing our previous experiments with the conversational telephone English Fisher corpus, we demonstrate where self-training succeeds and under what resource conditions it provides the most benefit. Additionally, we will show success on Spanish and Levantine conversational speech as well as the tougher English Callhome set, despite initial WER of more than 60%. Finally, by digging beneath average word error rate and analyzing individual word performance, we show that self-trained models successfully learn new words. More importantly, self-training benefits most words which appear in the unlabeled audio but do not appear in the manual transcriptions.

This talk is part of the Machine Intelligence Laboratory Speech Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Factors Affecting ASR Model Self-Training

This talk is included in these lists:

Other lists

Other talks