COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Computer Laboratory Systems Research Group Seminar > San Fermin: Aggregating Large Data Sets using Dynamic Binomial Trees
San Fermin: Aggregating Large Data Sets using Dynamic Binomial TreesAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Wenjun Hu. Content aggregation is an important sub-problem in distributed monitoring, distributed database queries, and software debugging. In this problem there are a large number of systems that have information and the requester is not interested in the result from each individual machine, but rather the aggregated results from all machines. Current solutions to this problem have looked at the case where the aggregate data is small (typically only a few bytes) and typically aggregate data by running a multicast tree in reverse. This talk describes a novel algorithm called San Fermin used to aggregate large data sets. San Fermin returns the answer from more nodes, computes the result faster, and has better scalability than existing solutions. Our evaluation explores different aggregation techniques using mathematical modeling, simulation, and deployment of a prototype on PlanetLab. Evaluation shows that San Fermin is scalable as either the number of nodes or the data size increases. San Fermin is also amazingly resilient to failures, so that when 10% of the nodes fail during aggregation it still returns the answer from over 97% of the nodes. Bio: Justin Cappos is currently working on his Ph. D. at the University of Arizona with John Hartman and Beichuan Zhang. His research is focused on improving the security and efficiency of real world networks of computer systems. He has lead a number of projects including Stork. This talk is part of the Computer Laboratory Systems Research Group Seminar series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsPhysics of Medicine Roadshow Program verification reading group. Frank KingOther talksUnderstanding mechanisms and targets of malaria immunity to advance vaccine development Deep & Heavy: Using machine learning for boosted resonance tagging and beyond Intrinsically Motivating Teachers;STIR's use of Data Driven Insight to Iterate, Pivot and (where necessary) Fail Fast Psychology and Suicidal Behaviour Insight into the molecular mechanism of extracellular matrix calcification in the vasculature from NMR spectroscopy and electron microscopy Research frontiers and new therapeutic strategies in pancreatic cancer 'Honouring Giulio Regeni: a plea for research in risky environments' Cambridge-Lausanne Workshop 2018 - Day 1 Structural basis for human mitochondrial DNA replication, repair and antiviral drug toxicity ***PLEASE NOTE THIS SEMINAR IS CANCELLED*** Curve fitting, errors and analysis of binding data |