University of Cambridge > Talks.cam > NLIP Seminar Series > A Proposal on Evaluation Measures for RTE

A Proposal on Evaluation Measures for RTE

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact Laura Rimell.

We outline problems with the interpretation of accuracy in the presence of bias, arguing that the issue is a particularly pressing concern for RTE evaluation. Furthermore, we argue that average precision scores are unsuitable for RTE , and should not be reported. We advocate mutual information as a new evaluation measure that should be reported in addition to accuracy and confidence-weighted score.

This talk is part of the NLIP Seminar Series series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2024 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity