Automatically Creating Reading Lists with Topical PageRank
- đ¤ Speaker: James Jardine, University of Cambridge
- đ Date & Time: Friday 24 February 2012, 12:00 - 13:00
- đ Venue: FW26, Computer Laboratory
Abstract
We present an algorithm for creating reading lists – lists of papers given by an expert to a novice, designed to bring the novice up to speed in a certain area. Our algorithm uses a variant of PageRank that is age-corrected and sensitive to the mixture of papers’ topics as determined by the LDA topic model. When compared to a gold standard of reading lists which we collected from experts, our algorithm outperforms three currently used keyword-based search engines: Lucene, Google Scholar and the Google-indexed ACL Anthology. As evaluation metrics we use F-measure, as well as a new evaluation metric specific to reading lists which we introduce here. It estimates the degree of substitutability of expert papers by system-found ones by the number of links in the citation network between them. We also evaluate on the task of reference list reintroduction. When reintroducing the reference list of thousands of papers, our unsupervised algorithm performs on a par with the current state-of-the-art method, which is supervised.
Series This talk is part of the NLIP Seminar Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Education Research
- Computing Education Research
- Department of Computer Science and Technology talks and seminars
- FW26, Computer Laboratory
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- NLIP Seminar Series
- ob366-ai4er
- PMRFPS's
- rp587
- School of Technology
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

James Jardine, University of Cambridge
Friday 24 February 2012, 12:00-13:00