COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Signal Processing and Communications Lab Seminars > Deep Learning for Multifarious Speech Processing: Tackling Multiple Speakers, Microphones, and Languages
Deep Learning for Multifarious Speech Processing: Tackling Multiple Speakers, Microphones, and LanguagesAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Prof. Ramji Venkataramanan. This talk has been canceled/deleted Speech processing has been at the forefront of the recent deep learning revolution, with major breakthroughs in automatic speech recognition, speech enhancement, and source separation. I will give an overview of deep learning techniques developed at MERL towards the goal of cracking the Tower of Babel version of the cocktail party problem, that is, separating and/or recognizing the speech of multiple unknown speakers speaking simultaneously in multiple languages. I will also attempt to present live demonstrations with audience participation (weather, time, and network conditions permitting). Bio: Jonathan Le Roux is a Senior Principal Research Scientist and the Speech and Audio Team Leader at Mitsubishi Electric Research Laboratories (MERL) in Cambridge, Massachusetts. He completed his B.Sc. and M.Sc. degrees in Mathematics at the Ecole Normale Supérieure (Paris, France), his Ph.D. degree at the University of Tokyo (Japan) and the Université Pierre et Marie Curie (Paris, France), and worked as a postdoctoral researcher at NTT ’s Communication Science Laboratories from 2009 to 2011. His research interests are in signal processing and machine learning applied to speech and audio. He has contributed to more than 80 peer-reviewed papers and 20 patents in these fields. He is a founder and chair of the Speech and Audio in the Northeast (SANE) series of workshops, a Senior Member of the IEEE , and a member of the IEEE Audio and Acoustic Signal Processing Technical Committee (AASP). This talk is part of the Signal Processing and Communications Lab Seminars series. This talk is included in these lists:This talk is not included in any other list Note that ex-directory lists are not shown. |
Other listsEconomics and Philosophy List 1 Three-dimensional cell culture: Innovations in tissue scaffolds and biomimetic systemsOther talksPSEUDO-DRAG OF A POLARITON SUPERFLUID Odd elasticity in soft active solids Talk 1. Using immersive reality to examine the U-shaped relationship between schema and memory performance Talk 2. Multivariate approaches to understanding the brain-behaviour relationships in cognitive ability Perioperative Communication and Decision Making: A social science perspective Linking permit markets multilaterally Data-Enabled Predictive Control of Autonomous Energy Systems |