| COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. | ![]() |
University of Cambridge > Talks.cam > CUED Speech Group Seminars > Reducing Speaker and Temporal Redundancy in Discrete Speech Tokenization
Reducing Speaker and Temporal Redundancy in Discrete Speech TokenizationAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Brian Sun. Discrete speech tokens have emerged as a fundamental representation for various downstream speech processing tasks, particularly in speech generation. However, most existing tokens encode dense, fixed-rate acoustic information, which introduces substantial redundancy and limits their efficiency. In this talk, I will first provide a brief review on the taxonomy of current discrete speech tokens, then present our works exploring the reduction of this information redundancy in two critical directions: (1) Speaker timbre disentanglement, introducing a low-bitrate, single-codebook and speaker-decoupled codec for speech. (2) Variable-rate temporal compression, exploring methods to dynamically adjust the frame rate of discrete tokens for better compactness and bitrate-performance tradeoff. Together, these efforts highlight pathways toward more efficient and controllable discrete speech representations, paving the way for the next generation of speech technologies. This talk is part of the CUED Speech Group Seminars series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsKing's Silk Roads Lister Institute Research Prize Fellowship Talks WHPC Cambridge & East Anglia, and RSEEE Kick-off eventOther talksThe Largest Air-Conditioned Room in the World: Houston and the History of Climate Change Bridging the gap between isotope geochemistry and plant sciences A Highly Efficient Machine Learning-Based Ozone Parameterization for Climate Sensitivity Simulations 100 years of educational trials – no significant difference? Climate Modelling Mapping the Cosmic Web in Lyα emission around quasars and non-AGN galaxies |