COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Microsoft Research Cambridge, public talks > How to force unsupervised neural networks to discover the right representation of images
How to force unsupervised neural networks to discover the right representation of imagesAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Microsoft Research Cambridge Talks Admins. One appealing way to design an object recognition system is to define objects recursively in terms of their parts and the required spatial relationships between the parts and the whole. These relationships can be represented by the coordinate transformation between an intrinsic frame of reference embedded in the part and an intrinsic frame embedded in the whole. This transformation is unaffected by the viewpoint so this form of knowledge about the shape of an object is viewpoint invariant. A natural way for a neural network to implement this knowledge is by using a matrix of weights to represent each part-whole relationship and a vector of neural activities to represent the pose of each part or whole relative to the viewer. The pose of the whole can then be predicted from the poses of the parts and, if the predictions agree, the whole is present. This leads to neural networks that can recognize objects over a wide range of viewpoints using neural activities that are ``equivariant’’ rather than invariant: as the viewpoint varies the neural activities all vary even though the knowledge is viewpoint-invariant. The ``capsules’’ that implement the lowest-level parts in the shape hierarchy need to extract explicit pose parameters from pixel intensities and these pose parameters need to have the right form to allow coordinate transformations to be implemented by matrix multiplies. These capsules are quite easy to learn from pairs of transformed images if the neural net has direct, non-visual access to the transformations, as it would if it controlled them. (Joint work with Sida Wang and Alex Krizhevsky) This talk is part of the Microsoft Research Cambridge, public talks series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsepigenetic club Logic & Semantics for Dummies Cambridge Global Food SecurityOther talksHorizontal transfer of antimicrobial resistance drives multi-species population level epidemics Stopping the Biological Clock – The Lazarus factor and Pulling Life back from the Edge. Atmospheric Retrieval In search of amethysts, black gold and yellow gold Borel Local Lemma Liver Regeneration in the Damaged Liver Understanding mechanisms and targets of malaria immunity to advance vaccine development Autumn Cactus & Succulent Show Scale and anisotropic effects in necking of metallic tensile specimens 'The Japanese Mingei Movement and the art of Katazome' Highly Energy Efficient Key-value Store for In-network Computing |