University of Cambridge > Talks.cam > CuAI (Cambridge University Artificial Intelligence Society) > Language in Vision: Visual Reasoning, Visual Navigation, Grounded Translation

Language in Vision: Visual Reasoning, Visual Navigation, Grounded Translation

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact srj38.

With deep learning and large-scale datasets, we can work on increasingly more ambitious computer vision problems and multimodal settings. In this presentation, I will talk about three computer vision problems where natural or synthetic language has played a fundamental role. Those problems are: 1) visual reasoning that takes the form of question answering about visual scenes, 2) visual navigation where the language provides instructions for the agent to follow, 3) grounded translation where, for a change, vision may turn out to be important for training translation models.

This talk is part of the CuAI (Cambridge University Artificial Intelligence Society) series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2024 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity