COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
Boosting in Location SpaceAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Edward Rosten. Computer-based object detection promises to vastly change our lives. Robots will be able to map their environment and make sense of the world. Scientists will have a new pair of eyes to sift through terabytes of images of molecules, proteins, waterways, and galaxies to discover novel science. Most approaches to object detection use general purpose machine learning algorithms that optimize a non-spatial objective. The classic sliding window approach predicts the presence or absence of an object in every window of an image; this optimizes the classifier for detection but not for localization. In contrast, this thesis introduces a new boosting algorithm (LocationBoost) that operates over an entire image at all times during training, directly predicts object locations, and minimizes a spatial loss function that is strongly motivated by object detection. The research of this dissertation led to seven major contributions in object detection. First, since object detection is ill-posed, a universal evaluation metric cannot be meaningful for all recognition tasks. Instead, we clearly define three different problems in small object detection and devise metrics well-matched to them. Second, we introduce generative grammars to combine primitive image features into composite features. Composite features are more informative and lead to more accurate object detection. Third, AdaBoost is fragile in the presence of noisy and ambiguous training data but we made spatially exploitative adaptations to the learning algorithm to greatly improve learning stability. Fourth, a radically different boosting algorithm (LocationBoost) is proposed that directly locates centers of small objects, bypassing the need for bounding boxes. Instead of boosting classifiers that predict whether or not a patch contains an object, our new approach boosts object detectors that produce a list of predicted object locations. LocationBoost uses a new spatial loss function reflecting the intuition that large areas of background are uninteresting and not worth spending computational effort on. Fifth, LocationBoost is extended so it can predict bounding boxes that enclose objects. Sixth, a multi-scale variant of LocationBoost is proposed to enable the detection of both large and small objects in the same image. In this variant, we show how the structure of multi-scale detection can be exploited to greatly speed up training and detection. Seventh, we propose a new primitive image feature based on FAST corner detection that enables real-time object detection. This talk is part of the er258's list series. This talk is included in these lists:Note that ex-directory lists are not shown. |
Other listsType the title of a new list here Japanese Society in Cambridge ケンブリッジ日本人会 Semantics and Pragmatics Research GroupOther talksSneks long balus Multilingual Identities and Heterogeneous Language Ideologies in the New Latino Diaspora Kiwi Scientific Acceleration on FPGA Electron Catalysis Intravital Imaging – Applications and Image Analysis/ Information session on Borysiewicz Biomedical Sciences Fellowships Bayesian optimal design for Gaussian process model Discovering regulators of insulin output with flies and human islets: implications for diabetes and pancreas cancer Single Cell Seminars (August) 'Honouring Giulio Regeni: a plea for research in risky environments' XZ: X-ray spectroscopic redshifts of obscured AGN How to Design a 21st Century Economy - with Kate Raworth |