COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Microsoft Research Cambridge, public talks > Mid-level Likelihoods and Constraints for 3D Scene Interpretation
Mid-level Likelihoods and Constraints for 3D Scene InterpretationAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Microsoft Research Cambridge Talks Admins. This event may be recorded and made available internally or externally via http://research.microsoft.com. Microsoft will own the copyright of any recordings made. If you do not wish to have your image/voice recorded please consider this before attending How do you infer the 3D properties of the world from a single image? This question has eluded researchers in psychology and computer vision for decades, and enabling machines to accomplish this task remains an open question. In this talk, I will present my research towards solving the 3D interpretation task. I will first talk about Data-Driven 3D Primitives, a new way of inferring surface normals / scene layout from a single image. These primitives are discovered from large-scale RGB -D data by optimizing two simple criteria: primitives should be visually discriminative and geometrically informative. I will show that a straightforward label-transfer inference approach on top of of these primitives produces state-of-the-art results on a complex and cluttered dataset, as well as effective cross-dataset performance. Local cues, however, are inadequate by themselves: scenes are highly constrained in structure. I will therefore also talk about my work on constraining the 3D interpretation of scenes via physical and functional reasoning. Specifically, I will present work on mid-level physical constraints for layout estimation in the form of the convex and concave edges from the classic line-labeling literature. Finally, I will discuss how recognizing humans in scenes, even with imprecise and noisy pose estimators, can provide valuable cues for scene geometry via functional reasoning. This talk is part of the Microsoft Research Cambridge, public talks series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsComputer Science Research Students mini-conference Type the title of a new list here The Forensic Use of Bioinformation Global Intellectual History Seminar Faculty Library Events (PPSIS) Medieval Archaeology Group Seminar SeriesOther talksSneks long balus Making a Crowdsourced Task Attractive: Measuring Workers Pre-task Interactions Real Time Tomography X-Ray Imaging System - Geometry Calibration by Optimisation The persistence and transience of memory White dwarfs as tracers of cosmic, galactic, stellar & planetary evolution Beacon Salon #7 Imaging Far and Wide EU LIFE Lecture - "Histone Chaperones Maintain Cell Fates and Antagonize Reprogramming in C. elegans and Human Cells" Retinal mechanisms of non-image-forming vision 'Politics in Uncertain Times: What will the world look like in 2050 and how do you know? 'Cryptocurrency and BLOCKCHAIN – PAST, PRESENT AND FUTURE' Glucagon like peptide-1 receptor - a possible role for beta cell physiology in susceptibility to autoimmune diabetes Joinings of higher rank diagonalizable actions |