COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Isaac Newton Institute Seminar Series > Candidates vs. Noises Estimation for Large Multi-Class Classification Problem
Candidates vs. Noises Estimation for Large Multi-Class Classification ProblemAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact INI IT. STSW04 - Future challenges in statistical scalability In practice, there has been sigificant interest in multi-class classification problems where the number of classes is large. Computationally such applications require statistical methods with run time sublinear in the number of classes. A number of methods such as Noise-Contrastive Estimation (NCE) and variations have been proposed in recent years to address this problem. However, the existing methods are not statistically efficient compared to multi-class logistic regression, which is the maximum likelihood estimate. In this talk, I will describe a new method called Candidate v.s. Noises Estimation (CANE) that selects a small subset of candidate classes and samples the remaining classes. We show that CANE is always consistent and computationally efficient. Moreover, the resulting estimator has low statistical variance approaching that of the maximum likelihood estimator, when the observed label belongs to the selected candidates with high probability. Extensive experimental results show that CANE achieves better prediction accuracy over a number of the state-of-the-art tree classifiers, while it gains significant speedup compared to standard multi-class logistic regression. This talk is part of the Isaac Newton Institute Seminar Series series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsFestival of Ideas: Spotlight Talks Institute of Metabolic Science Seminars Isolation and molecular identification of actinomycete microflora, of ... cat.inist.fr/?aModele=afficheN&cpsidt=17110743 de A BOUDEMAGH - 2005 - Cité 24 fois - Autres articles ... of some saharian soils of south east Algeria (Biskra, EL-OuedOther talksUQ: does it require efficient linear algebra? 38th Cambridge Epigenetics Seminar Causality and Streaming Data Ramble through my greenhouse and Automation Cancer and Metbolism 2018 Future Challenges for Data Analytics for Energy Management |