University of Cambridge > Talks.cam > Isaac Newton Institute Seminar Series > Probabilistic Record Linkage and Deduplication after Indexing, Blocking, and Filtering

Probabilistic Record Linkage and Deduplication after Indexing, Blocking, and Filtering

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact INI IT.

DLAW01 - 'Data linkage and anonymisation: setting the agenda'

When linking two databases (or deduplicating a single database) the number of possible links grows rapidly in the size of the databases under consideration, and in most applications it is necessary to first reduce the number of record pairs that will be compared. Spurred by practical considerations, a range of indexing or blocking methods have been developed for this task. However, methods for inferring linkage structure that account for indexing, blocking, and filtering steps have not seen commensurate development. I review the implications of indexing, blocking and filtering, focusing primarily on the popular Fellegi-Sunter framework and proposing a new model to account for particular forms of indexing and filtering. 

This talk is part of the Isaac Newton Institute Seminar Series series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2024 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity