Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Confidence estimation for attention-based encoder-decoder models for speech recognition

Add to your list(s) Download to your calendar using vCal

Qiujia Li (University of Cambridge)
Monday 06 June 2022, 12:00-13:00
Zoom: https://eng-cam.zoom.us/j/81927138251?pwd=TVd3MXliV003dUdYVlFwU2NDWGpmdz09.

If you have a question about this talk, please contact Dr Jie Pu.

This talk will be both online (zoom) and offline (CBL Meeting Room)

Abstract: Confidence scores have been an intrinsic part of a conventional speech recogniser. As end-to-end ASR models such as attention-based encoder-decoder models become increasingly popular, it is of great interest to develop reliable confidence estimators for various downstream tasks. In this talk, I will present the confidence estimation module (CEM) for token/word-level confidence scores, and the residual energy-based model (R-EBM) for utterance-level confidence scores for attention-based models. Interestingly, R-EBM can also help improve the ASR performance. Furthermore, some effective techniques for generalising these model-based confidence estimators to out-of-domain data will be discussed.

Bio: Qiujia Li is a fourth-year PhD student at the University of Cambridge, advised by Prof. Phil Woodland. He obtained his BA and MEng also from Cambridge University. His research interests lie primarily in speech processing and machine learning, including end-to-end speech recognition, confidence estimation and speaker diarization. He has published more than a dozen papers at ICASSP , Interspeech, SLT , ASRU, NeurIPS and ICCV , of which two won the best student paper awards at ASRU 2019 and SLT 2021 . He previously worked as a research intern with Microsoft in 2018 and Google in 2020.

This talk is part of the CUED Speech Group Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Confidence estimation for attention-based encoder-decoder models for speech recognition

This talk is included in these lists:

Other lists

Other talks