COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > CuAI (Cambridge University Artificial Intelligence Society) > Language in Vision: Visual Reasoning, Visual Navigation, Grounded Translation
Language in Vision: Visual Reasoning, Visual Navigation, Grounded TranslationAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact srj38. With deep learning and large-scale datasets, we can work on increasingly more ambitious computer vision problems and multimodal settings. In this presentation, I will talk about three computer vision problems where natural or synthetic language has played a fundamental role. Those problems are: 1) visual reasoning that takes the form of question answering about visual scenes, 2) visual navigation where the language provides instructions for the agent to follow, 3) grounded translation where, for a change, vision may turn out to be important for training translation models. This talk is part of the CuAI (Cambridge University Artificial Intelligence Society) series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsCUSynBioSoc Trinity College Science Society (TCSS) School of Technology Research Funding MasterclassesOther talksThe role of content-free pointers in online visual memory The Vagrancy of Economic Invisibility PP2A-B55 inhibitors Arpp19 and ENSA define the cell cycle program by controlling the temporal pattern of protein phosphorylation Protein complexes subjected to tandem mass spectrometry reveal allosteric binding partners Inference in Stochastic Processes Statistics Clinic Lent 2021 - Skype session IV |