University of Cambridge > Talks.cam > Microsoft Research Cambridge, public talks > Sankie: Using Data to Build Better Systems and Services

Sankie: Using Data to Build Better Systems and Services

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact Microsoft Research Cambridge Talks Admins.

Please note, this event may be recorded. Microsoft will own the copyright of any recording and reserves the right to distribute it as required.

Today’s systems and services are large and complex, often supporting millions or even billions of entities. Such systems are extremely dynamic as developers continuously commit code and introduce new features, fixes and, consequently, new bugs. Multiple problems crop up in such a dynamic environment, from misconfiguration of essential services, very slow testing and deployment procedures, and extended service disruptions when catastrophic bugs hit deployment. Over the last three years, we have been working on Project Sankie which uses code, test logs and telemetry as data to build several analyses to aid engineers. My talk will describe two of these analyses in detail. First, I will present Rex, a tool that uses commit logs from the last six months to find and flag misconfigurations at commit-time, thereby ensuring such misconfigurations do not enter deployment. Next, I will present Orca, a bug localization tool that causes a three-fold reduction in Office 365’s on-call engineer workload. Finally, I will briefly summarize the status of Project Sankie and some lessons I received through experience of what strategies worked, and what did not.

This talk is part of the Microsoft Research Cambridge, public talks series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2024 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity