University of Cambridge > Talks.cam > CUED Speech Group Seminars > Towards understanding the performance of individuals within automatic speaker recognition systems

Towards understanding the performance of individuals within automatic speaker recognition systems

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact Simon Webster McKnight.

Automatic speaker recognition (ASpR) systems are widely used in commercial settings for the purposes of personalisation and security, and are now increasingly being used to evaluate forensic evidence presented in court. Despite improvements in overall performance, even with forensically realistic materials (Morrison and Enzinger 2019), very little is known about why certain voices would be easy or difficult for systems to recognise.

In this talk, I will introduce a current ESRC -funded project (Person-specific ASR ) which attempts to better understand and explain individual variability in ASpR performance, with a particular focus on forensic applications. I will present data from a series of studies using both a small-scale, highly controlled dataset of recordings containing extreme vocal variation, as well as a large-scale, forensically realistic database provided to us by the UK Government. Finally, I will discuss our initial attempts to handle problematic speakers for ASpR systems, through condition adaptation and tailored calibration.

This talk is part of the CUED Speech Group Seminars series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2024 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity