University of Cambridge > Talks.cam > Isaac Newton Institute Seminar Series > Modern Bayesian Record Linkage: Some Recent Developments and Open Challenges

Modern Bayesian Record Linkage: Some Recent Developments and Open Challenges

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact INI IT.

DLAW01 - 'Data linkage and anonymisation: setting the agenda'

Record linkage, also known as de-duplication, entity resolution, and coreference resolution is the process of merging together noisy databases to remove duplicate entities. Record linkage is becoming more essential in the age of big data, where duplicates are ever present in such applications as official statistics, human rights, genetics, electronic medical data, and so on. We briefly review the genesis of record linkage with the work of Newcombe in 1959, and then move to recent Bayesian developments using novel clustering approaches in recent work. We speak of recent challenges that have been overcome and ones that are present, needing guidance and attention.

This talk is part of the Isaac Newton Institute Seminar Series series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2024 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity