ROAR: Increasing the Flexibility and Performance of Distributed Search
- đ¤ Speaker: Costin Raiciu (UCL)
- đ Date & Time: Thursday 19 March 2009, 16:00 - 17:00
- đ Venue: FW26, Computer Laboratory, William Gates Builiding
Abstract
To be able to search the web, modern search engines partition the web index over many machines, each of which is consulted when answering a query. To increase query throughput, replicas are added for each of these machines.
The key parameter of web search algorithms is the trade-off between replication and partitioning: while increasing the partitioning level (which reduces the replication level) improves query completion time since more servers handle the query, increasing it indefinitely incurs non-negligible startup costs for each sub-query. Further, the optimal point changes if we factor in costs related to storing and updating the data set, or if the query rate changes. Finding the right operating point and adapting to it is crucial, yet current algorithms assume a fixed trade-off, with reconfiguration being slow and manual.
In this work, we introduce Rendezvous On a Ring (ROAR), a novel distributed algorithm that enables on-the-fly re-configuring of the partitioning level while still servicing queries. In addition, ROAR can add and remove servers without stopping the system, cope with temporary and permanent server failures, and provide very good load-balancing even in the face of servers having heterogeneous hardware capabilities.
To support these claims, we present results from a 43-server testbed deployment of ROAR .
Bio: I am finishing my PhD at UCL on “Making Privacy Preserving Search Practical”, under the supervision of Mark Handley and David Rosenblum. My thesis examines how we can execute encrypted queries on encrypted data, and shows how to paralelize this search to make it applicable in practice. The latter technique is more general and has broader implications in the design of distributed, data-center based, web search algorithms.
I am also working with Mark Handley and Damon Wischik of UCL (along with others in the Trilogy EU project) on designing a multipath transport protocol and an appropriate congestion control algorithm that achieves resource pooling.
Generally, I am interested in distributed systems and networking, as well as security, and I have worked on a few topics in these fields. More info on my webpage at: http://www.cs.ucl.ac.uk/staff/C.Raiciu/
Series This talk is part of the Computer Laboratory Systems Research Group Seminar series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge talks
- Chris Davis' list
- CL's SRG seminar
- Computer Laboratory Systems Research Group Seminar
- Department of Computer Science and Technology talks and seminars
- FW26, Computer Laboratory, William Gates Builiding
- Interested Talks
- ndk22's list
- ob366-ai4er
- rp587
- School of Technology
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Costin Raiciu (UCL)
Thursday 19 March 2009, 16:00-17:00