COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Genetics Seminar > Beyond the reference genome: inference using prior knowledge of the diversity of a species.
Beyond the reference genome: inference using prior knowledge of the diversity of a species.Add to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Caroline Newnham. Host: Chris Illingworth Encoding the genetic diversity of a species into a ³graph reference², with applications to antibiotic-resistance and antigenic variation The choice of reference object used to represent the genome of a species has far reaching effects on biological inference. The use of a ³reference genome² made pragmatic sense when sequencing was expensive. However it can also lead to under-detection of non-reference alleles and various artefacts. The problem is substantial in regions of genomes of high diversity, such as the human MHC , or where unusual mutational mechanisms occur, as observed in the var genes in P. falciparum, for instance. Indeed, the limitations of a single reference genome have long been recognized in bacterial genomics, where the term pan-genome has been coined to refer to the full set of genes found across all individuals of a species. Recently progress has been made towards using an ensemble of genomes or variation data as a first-class object . However many algorithmic challenges remain and few implementations exist. I will discuss two lines of research from my group, based on the idea of building a reference graph of known sequence and variation in a species. The first relates to the prediction of antibiotic resistance in bacteria, recently published for S. aureus and M. tuberculosis (nature.com/ncomms/2015/151221/ncomms10063/full/ncomms10063.html). The second shows how a new data structure based on a generalised Burrows-Wheeler Transform and FM-index allow encoding of a graph of variation and read-mapping to the graph. I will show the impact of this method on the study of antigenic diversity in P. falciparum, the parasite responsible for the most deadly form of malaria. This talk is part of the Genetics Seminar series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsPhonetics & Phonology Research Cluster Lucy Cavendish College public lecture series Cambiowebinars Seminar Series Feb-Apr 2015 Economic Epidemiology Millennium Maths Project public and schools' events Silicon Valley comes to the UK 2011Other talksSurface meltwater ponding and drainage causes ice-shelf flexure Player 2 has entered the game - ways of working towards open science Towns, Cities and the Tilting of Britain's Political Axis New Insights in Immunopsychiatry (Provisional Title) Making Refuge: Scripture and Refugee Relief |