Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Using evolutionary sequence variation to build predictive models of protein structure and function.

Add to your list(s) Download to your calendar using vCal

Lucy Colwell (University of Cambridge)
Wednesday 12 October 2016, 14:00-15:00
MR4, Centre for Mathematical Sciences, Wilberforce Road, Cambridge.

If you have a question about this talk, please contact Emily Boyd.

The evolutionary trajectory of a protein through sequence space is constrained by its function. Collections of sequence homologs record the outcomes of millions of evolutionary experiments in which the protein evolves according to these constraints. The explosive growth in the number of available protein sequences raises the possibility of using the natural variation present in homologous protein sequences to infer these constraints and thus identify residues that control different protein phenotypes. Because in many cases phenotypic changes are controlled by more than one amino acid, the mutations that separate one phenotype from another may not be independent, requiring us to understand the correlation structure of the data.

The challenge is to distinguish true interactions from the noisy and under-sampled set of observed correlations in a large multiple sequence alignment. We show that maximum entropy models of the protein sequence, constrained by the statistics of the multiple sequence alignment, are capable of predicting key aspects of protein function. These include (i) the inference of residue pair interactions that are accurate enough to predict all atom 3D structural models; (ii) accurate predictions of binding partners between different proteins; (iii) accurate prediction of binding between protein receptors and their target ligands. We will discuss how a mathematical framework based on random matrix theory bounds which sequence alignments contain sufficient information to build accurate predictive models. Finally, we will pose questions about the physics of binding interactions in an example from the immune system where large sets of evolutionarily related sequences are not available.

This talk is part of the Computational and Systems Biology series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Using evolutionary sequence variation to build predictive models of protein structure and function.

This talk is included in these lists:

Other lists

Other talks