COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > CUED Speech Group Seminars > End-to-end contextual speech recognition with Tree-constrained pointer generator
End-to-end contextual speech recognition with Tree-constrained pointer generatorAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Simon Webster McKnight. Contextual knowledge is of vital importance to end-to-end automatic speech recognition (ASR) systems, especially for the long-tailed word problem where systems suffer from degraded performance on rare or unseen words that are both relevant to the context and carrying important information. Integrating such contextual knowledge into such end-to-end systems is both necessary and challenging, as contextual knowledge is always dynamically changing while neural systems adopt a static set of trained parameters. In ASR , dynamic contextual knowledge is often incorporated via contextual biasing, where a list of rare words or phrases that are likely to appear in a given context is included, denoted as a biasing list of biasing words. A word is more likely to be correctly recognised if it is incorporated into the biasing list. This talk introduces tree-constrained pointer generator (TCPGen) as an effective neural-based biasing component for end-to-end contextual ASR . TCPGen effectively integrate contextual knowledge via a pointer generator mechanism, and efficiently structures biasing lists into prefix-trees. This talk includes the detailed TCP Gen approach, the use of graph neural networks for tree encodings and its application to Whisper models. This talk is part of the CUED Speech Group Seminars series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsCUSPE Forum Violence and Conflict Graduate Workshop, Faculty of History Philomathia Social Sciences Research ProgrammeOther talksThicker Than Blood: Kinship, Its Multiplicity, and Entanglements - a Cambridge Reproduction Forum Shock and Impact Deformation: A Geological Perspective Human Remains from Pompeii. Excavation, research and valorisation The quest for the first stars and first black holes with the James Webb Space Telescope Learning to Receive Help: Intervention-Aware Concept Embedding Models Reproductive justice in the climate crisis - Seminar Three |