Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Robust speech processing on multi-talker mixed speech

Add to your list(s) Download to your calendar using vCal

Yanmin Qian, Shanghai Jiao Tong University
Friday 10 May 2019, 11:00-12:00
Department of Engineering - James Dyson Building Seminar Room.

If you have a question about this talk, please contact Yu Wang.

Although the performance of speech processing systems has surpassed the threshold for adoption for many real-world applications, e.g. speech recognition, speaker recognition, and speech enhancement, etc., there are still difficulties to be solved to make intelligent speech technologies more widely deployed. The cocktail party problem, i.e., tracing and recognizing the speech from a specific speaker when multiple speakers talk simultaneously and when other background noise is involved, is one such problem. The cocktail party problem has been wideyl observed. Solving it could enable many scenarios and applications, such as meeting transcription, multi-party human–machine interaction, and hearing impairment assistants, where overlapped speech cannot be ignored. In this talk we will first briefly review the techniques proposed in the last two decades in attacking this problem. Then we will present techniques, which are developed in SJTU SpeechLab during the recent years, to improve the performance of speech processing system in the cocktail party environment. Two tasks are mainly focused, including multi-talker speech separation and multi-talker speech recognition. The newly proposed approaches significantly boost the separation quality and recognition accuracy on the multi-talker mixed speech processing. Detailed comparison and promising results will be given on some benchmark tasks.

This talk is part of the CUED Speech Group Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Robust speech processing on multi-talker mixed speech

This talk is included in these lists:

Other lists

Other talks