Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Towards Improving End-to-End Neural Diarization

Add to your list(s) Download to your calendar using vCal

Dr Federico Landini, Brno University of Technology
Tuesday 06 August 2024, 12:00-13:00
Hybrid: JDB Seminar Room, Engineering Department or Zoom: https://cam-ac-uk.zoom.us/j/88498768580?pwd=1zjqKCU8AiRcd7ZR6SXBTjc0ScElsc.1.

If you have a question about this talk, please contact Simon Webster McKnight.

Until recently, diarization systems were formed by different submodules like voice activity detection, embedding extraction and clustering of such embeddings. However, the last quinquennial has seen many developments in diarization towards end-to-end models. These models, unlike modular ones, are trained to optimize a diarization-related loss and provide a more straightforward inference. Nevertheless, end-to-end systems still pose certain challenges. In this talk, I will comment on some of the work I did addressing some of their problems regarding synthetic training data generation and handling variable numbers of speakers.

This talk is part of the CUED Speech Group Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Towards Improving End-to-End Neural Diarization

This talk is included in these lists:

Other lists

Other talks