COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Isaac Newton Institute Seminar Series > High-dimensional variable selection when features are sparse
High-dimensional variable selection when features are sparseAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact INI IT. STS - Statistical scalability It is common in modern prediction problems for many predictor variables to be counts of rarely occurring events. This leads to design matrices in which a large number of columns are highly sparse. The challenge posed by such “rare features” has received little attention despite its prevalence in diverse areas, ranging from biology (e.g., rare species) to natural language processing (e.g., rare words). We show, both theoretically and empirically, that not explicitly accounting for the rareness of features can greatly reduce the effectiveness of an analysis. We next propose a framework for aggregating rare features into denser features in a flexible manner that creates better predictors of the response. An application to online hotel reviews demonstrates the gain in accuracy achievable by proper treatment of rare words. This is joint work with Xiaohan Yan. This talk is part of the Isaac Newton Institute Seminar Series series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsCEB Career Talks CSaP Professional Development Policy Seminars Critical Theory and Practice SeminarOther talksUrinary Sphincter Mechanism Incompetence (USMI) treated with Static Hydraulic Urethral Sphincter (SHUS), complications and long-term outcome in female dogs The diversifying nature of impact - Publishing tips from 'Nature Biomedical Engineering' Magnetic-field Induced Pair Density Wave State in the Cuprate Vortex Halo Quantum Computing - Theory and Practice: Introduction and overview The DNA oxygenase TET1 in mammalian embryonic development and epigenetic reprogramming Full-field optical metrology |