Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

On Over-fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation

Add to your list(s) Download to your calendar using vCal

DR. Tom Minka (Microsoft Research Cambridge)
Thursday 19 May 2011, 14:00-15:30
Engineering Department, CBL Room 438.

If you have a question about this talk, please contact Konstantina Palla.

This reading group meeting will discuss the following paper:

“On Over-fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation”
by Gavin C. Cawley and Nicola L. C. Talbot
Journal of Machine Learning Research 11 (2010) 2079-2107
http://jmlr.csail.mit.edu/papers/v11/cawley10a.html

Model selection strategies for machine learning algorithms typically involve the numerical optimisation of an appropriate model selection criterion, often based on an estimator of generalisation performance, such as k-fold cross-validation. The error of such an estimator can be broken down into bias and variance components. While unbiasedness is often cited as a beneficial quality of a model selection criterion, we demonstrate that a low variance is at least as important, as a non-negligible variance introduces the potential for over-fitting in model selection as well as in training the model. While this observation is in hindsight perhaps rather obvious, the degradation in performance due to over-fitting the model selection criterion can be surprisingly large, an observation that appears to have received little attention in the machine learning literature to date. In this paper, we show that the effects of this form of over-fitting are often of comparable magnitude to differences in performance between learning algorithms, and thus cannot be ignored in empirical evaluation. Furthermore, we show that some common performance evaluation practices are susceptible to a form of selection bias as a result of this form of over-fitting and hence are unreliable. We discuss methods to avoid over-fitting in model selection and subsequent selection bias in performance evaluation, which we hope will be incorporated into best practice. While this study concentrates on cross-validation based model selection, the findings are quite general and apply to any model selection practice involving the optimisation of a model selection criterion evaluated over a finite sample of data, including maximisation of the Bayesian evidence and optimisation of performance bounds.

This talk is part of the Machine Learning Reading Group @ CUED series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

On Over-fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation

This talk is included in these lists:

Other lists

Other talks