Language in Vision: Visual Reasoning, Visual Navigation, Grounded Translation
- đ¤ Speaker: Mateusz Malinowski (DeepMind)
- đ Date & Time: Friday 19 February 2021, 17:00 - 18:00
- đ Venue: https://cern.zoom.us/j/67127006104?pwd=SkwxUWN4Zm1ST3BEKzJTSGwycU53Zz09
Abstract
With deep learning and large-scale datasets, we can work on increasingly more ambitious computer vision problems and multimodal settings. In this presentation, I will talk about three computer vision problems where natural or synthetic language has played a fundamental role. Those problems are: 1) visual reasoning that takes the form of question answering about visual scenes, 2) visual navigation where the language provides instructions for the agent to follow, 3) grounded translation where, for a change, vision may turn out to be important for training translation models.
Series This talk is part of the CuAI (Cambridge University Artificial Intelligence Society) series.
Included in Lists
- CuAI (Cambridge University Artificial Intelligence Society)
- https://cern.zoom.us/j/67127006104?pwd=SkwxUWN4Zm1ST3BEKzJTSGwycU53Zz09
- Talks cs
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Mateusz Malinowski (DeepMind)
Friday 19 February 2021, 17:00-18:00