Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Learning to see with deep learning architectures for localisation and scene understanding

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact Mariana Marasoiu.

Abstract

We can now teach machines to recognize objects. However, in order to teach a machine to “see” we need to understand geometry as well as semantics. Given an image of a road scene, for example, an autonomous vehicle needs to determine where it is, what’s around it, and what’s going to happen next? This requires not only object recognition, but depth, motion and spatial perception, and instance-level identification. We present work towards solving these problems using deep learning.

The first, SegNet, is a deep convolutional network architecture designed to map input RGB images to pixel labels for scene understanding. It is composed of an encoder network and a decoder network which ends with a softmax classifier. The entire architecture can be trained end-to-end using stochastic gradient descent. SegNet can produce a dense pixel-wise output in real-time with a measure of model uncertainty. We show SegNet applied to both classification and regression tasks.

Secondly, PoseNet is a real-time relocalisation system. We show how to train very deep networks to regress the camera’s 3D position and orientation from a single image. The algorithm can operate over large scale indoor and outdoor areas in real time.

Live web demonstrations and links to publications can be found on our project webpages: http://mi.eng.cam.ac.uk/projects/segnet/ http://mi.eng.cam.ac.uk/projects/relocalisation/

This talk is part of the Rainbow Group Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Learning to see with deep learning architectures for localisation and scene understanding

This talk is included in these lists:

Other lists

Other talks