COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Isaac Newton Institute Seminar Series > An empirical study of the effect of sequence alignment on phylogenetic analysis
An empirical study of the effect of sequence alignment on phylogenetic analysisAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Mustapha Amrani. This talk has been canceled/deleted Phylogenetic analyses start with a multiple sequence alignment, which is often accepted as known despite wide recognition that errors may impact downstream phylogenetic analysis. Many phylogenetic methods involve testing which of a range of competing hypotheses best describe the evolution of a set of sequences. These tests may be justified statistically when using the correct alignment, but errors in the alignment lead to non-homologous characters being placed together, which in turn may systematically bias the test. We investigate empirically the impact of different alignment methods on phylogenetic analyses and assess the relative impact of different approximations used by different alignment methods. We examine the effect of alignment on two phylogenetic analyses that are commonly used in computational biology: the inference of a maximum-likelihood tree using RAxML, and a test for positive selection by comparing the M7 and M8 models in PAML . We test 200 sets of sequences from the Adaptive Evolution Database using the popular aligners ClustalW, Muscle, MAAFT , ProbCons, and the phylogenetic aligner Prank. We also sample from the posterior distribution of the statistical aligner BAli-Phy, which enables us to compare the relative impact of aligner choice to uncertainty from a single aligner. The algorithmic basis of an aligner tends to determine the outcome of the phylogenetic analysis. For example, trees estimated from progressive aligners tend to be more similar to one another than those estimated from phylogenetically aware (Prank) or consensus (ProbCons) aligners. Moreover the spread of phylogenetic parameter estimates inferred from BAli-Phy’s posterior distribution of alignments is much smaller than the differences between other aligners, suggesting differences are larger than could be expected by chance. Of the aligners examined, our results suggest that the phylogenetically informed Prank provides the closest approximation to full statistical alignment. This talk is part of the Isaac Newton Institute Seminar Series series. This talk is included in these lists:This talk is not included in any other list Note that ex-directory lists are not shown. |
Other listsUCL based talks series Qualitative Research Forum - Open meetings Thinking Society: General and ParticularOther talks***PLEASE NOTE THIS SEMINAR IS CANCELLED*** Climate change, archaeology and tradition in an Alaskan Yup'ik Village Carers and Careers: The Impact of Caring on Academic Careers CANCELLED: The rise and fall of the Shopping Mall: dialogues on the relationship of commerce and city Making a Crowdsourced Task Attractive: Measuring Workers Pre-task Interactions Amino acid sensing: the elF2a signalling in the control of biological functions Dynamics of Phenotypic and Genomic Evolution in a Long-Term Experiment with E. coli Inferring the Evolutionary History of Cancers: Statistical Methods and Applications Single Cell Seminars (November) Speculations about homological mirror symmetry for affine hypersurfaces Bayesian optimal design for Gaussian process model |