COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Cambridge Analysts' Knowledge Exchange > Random projection ensemble classification
Random projection ensemble classificationAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Adam Kashlak. We introduce a very general method for high-dimensional classification, based on careful combination of the results of applying an arbitrary base classifier to random projections of the feature vectors into a lower-dimensional space. In one special case presented here, the random projections are divided into non-overlapping blocks, and within each block we select the projection yielding the smallest estimate of the test error. Our random projection ensemble classifier then aggregates the results of applying the base classifier on the selected projections, with a data-driven voting threshold to determine the final assignment. Our theoretical results elucidate the effect on performance of increasing the number of projections. Moreover, under a boundary condition implied by the sufficient dimension reduction assumption, we control the test excess risk of the random projection ensemble classifier. A simulation comparison with several other popular high-dimensional classifiers reveals its excellent finite-sample performance. This is joint work with Richard Samworth. This talk is part of the Cambridge Analysts' Knowledge Exchange series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsInternational Political Theory Research Seminar Series david brailsford Type the title of a new list here Faculty of Education Special Events Thin Film Magnetism Group Seminars Isaac Newton Institute Seminar SeriesOther talksDevelopment of machine learning based approaches for identifying new drug targets TBC UK 7T travelling-head study: pilot results Investigation into appropriate statistical models for the analysis and visualisation of data captured in clinical trials using wearable sensors |