University of Cambridge > Talks.cam > Isaac Newton Institute Seminar Series > Sanitization for sequential data

Sanitization for sequential data

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact info@newton.ac.uk.

DLA - Data linkage and anonymisation

Organizations disseminate sequential data to support applications in domains ranging from marketing to healthcare. Such data are typically modeled as a collection of sequences, or a series of time-stamped events, and they are mined by data recipients aiming to discover actionable knowledge. However, the mining of sequential data may expose sensitive patterns that leak confidential knowledge, or lead to intrusive inferences about groups of individuals.   In this talk, I will review the problem and present two approaches that prevent it, while retaining the usefulness of data in mining tasks. The first approach is applicable to a collection of sequences and sanitizes sensitive patterns by permuting their events. The selected permutations avoid changes in the set of frequent non-sensitive patterns and in the ordering information of the sequences. The second approach is applicable to a series of time-stamped events and sanitizes sensitive events by deleting them from carefully selected time points. The deletion of events is guided by a model that captures changes to the probability distribution of events across the sequence.  

This talk is part of the Isaac Newton Institute Seminar Series series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2019 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity