COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Isaac Newton Institute Seminar Series > Computational Methods for Linking Sets of National Files
Computational Methods for Linking Sets of National FilesAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact INI IT. DLAW02 - Data linkage: techniques, challenges and applications A combination of faster hardware and new computational algorithms makes it possible to link two or more national files having suitable quasi-identifying information such as name, address, date-of-birth and other non-uniquely identifying information far faster than methods of a decade earlier. The methods (Winkler, Yancey, and Porter 2010) were used for matching 10^17 pairs (300 million x 300 million) using 40 cpus of an SGI machine (with 2006 Itanium chips) in less than 30 hours during the 2010 U.S. Decennial Census. The methods are 50 times as fast as PSwoosh parallel software (Kawai et al. 2006) from Stanford University. The methods are ~10 times as fast as recent parallel software that applies new methods of load balancing (Rahm and Kolb 2013, Yan et al. 2013, Karapiperis and Verykios 2014). This talk will describe how this software bypasses the needs for system sorts and provides highly optimized search-retrieval-comparison for a narrow range of situations needed for record linkage.
This talk is part of the Isaac Newton Institute Seminar Series series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsCambridge Statistics Discussion Group (CSDG) Research Reports Events Experience Islam Week 2011 (12th February - 20th February) Market Square: Cambridge Business and Society Interdisciplinary Research Group Stephen Cowley's Meetings Druggability and the Genome (EBI, Hinxton, 4th February 2008)Other talksAncient DNA studies of early modern humans and late Neanderthals Neurological Problems Psychological predictors of risky online behaviour: The cases of online piracy and privacy Climate Change: Protecting Carbon Sinks Polish Britain: Multilingualism and Diaspora Community |