COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Statistics > Frequency and cardinality recovery from sketched data: a novel approach bridging Bayesian and frequentist views
Frequency and cardinality recovery from sketched data: a novel approach bridging Bayesian and frequentist viewsAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Qingyuan Zhao. We study how to recover the frequency of a symbol in a large discrete data set, using only a (lossy) compressed representation, or sketch, of those data obtained via random hashing. This is a classical problem at the crossroad of computer science and information theory, with various algorithms available, such as the count-min sketch. However, these algorithms often assume that the data are fixed, leading to overly conservative and potentially inaccurate estimates when dealing with randomly sampled data. In this talk, we consider the sketched data as a random sample from an unknown distribution, and then we introduce novel estimators that improve upon existing approaches. Our method combines Bayesian nonparametric and classical (frequentist) perspectives, addressing their unique limitations to provide a principled and practical solution. Additionally, we extend our method to address the related but distinct problem of cardinality recovery, which consists of estimating the total number of distinct objects in the data set. We validate our method on synthetic and real data, comparing its performance to state-of-the-art alternatives. This talk is part of the Statistics series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsRelocating Human Origins - What if Adam lived in the forest? Zoology Postdoc Summer Seminar Series Topology SeminarOther talksAndrew Robinson on "Vital Signs: What has Semiotics to do with the Origins of Life?" HONORARY FELLOWS LECTURE - Every breath you take and every move you make - understanding cellular oxygen sensing mechanisms SCIENCE AND THE FUTURES OF MEDICINE One Day Meeting Sustainability at The Netherlands eScience Center Everything Everywhere All at Once: Holographic Entropy Inequalities, the Topology of Error Correction, Black Holes, Cubohemioctahedron, and (maybe) the Toric Code |