COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Computer Laboratory Systems Research Group Seminar > Naiad: a system for incremental, iterative and interactive parallel computation
Naiad: a system for incremental, iterative and interactive parallel computationAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Eiko Yoneki. We are developing a new system for large-scale data analysis—called “Naiad”—which has the goal of supporting complex iterative queries over dynamic inputs at interactive timescales. Like many existing systems, Naiad supports high-level declarative queries, data-parallel execution, and transparent distribution. Unlike these systems, Naiad can efficiently execute queries with multiple (possibly nested) iterative loops, while simultaneously supporting low-latency incremental changes to the query inputs. As a highlight of its characteristics, Naiad can not only efficiently compute the strongly connected component structure of a 24 hour sliding window of the Twitter @mention graph (using a doubly nested fixed-point computation), but also maintains the computation with sub-second latencies in the face of Twitter’s full volume of continuously arriving tweets. I will describe the computational model underlying Naiad, a generalization of traditional incremental dataflow to partially ordered logical times, and work through some of the (very friendly, picture oriented) mathematical details. I will also highlight several new distributed systems challenges faced in order to fully realize the multiple orders-of-magnitude performance improvements Naiad presents. This is joint work with Derek Murray, Rebecca Isaacs, Michael Isard, and Martìn Abadi. Bio: Frank McSherry is a Senior Researcher at Microsoft’s Silicon Valley Lab, where he focuses on issues related to large-scale data analysis. He has previously worked on machine learning and privacy issues, and is currently hard at work on large-scale low-latency computational infrastructure. This talk is part of the Computer Laboratory Systems Research Group Seminar series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsTCM Journal Club Bioenergy Initiative 10th Annual Sustainable Development Lecture Series 2012 ADF: Amsterdam Density Functional, Concepts and Applications UK~IRC Summit Innovation ForumOther talksMarket Socialism and Community Rating in Health Insurance Positive definite kernels for deterministic and stochastic approximations of (invariant) functions NatHistFest: the 99th Conversazione and exhibition on the wonders of the natural world. Insight into the molecular mechanism of extracellular matrix calcification in the vasculature from NMR spectroscopy and electron microscopy Requirements in Application Development Aspects of adaptive Galerkin FE for stochastic direct and inverse problems Fields of definition of Fukaya categories of Calabi-Yau hypersurfaces Cambridge-Lausanne Workshop 2018 - Day 2 Sneks long balus The Anne McLaren Lecture: CRISPR-Cas Gene Editing: Biology, Technology and Ethics Inferring the Evolutionary History of Cancers: Statistical Methods and Applications TODAY Adrian Seminar - "Functional synaptic architecture of visual cortex" |