Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Efficient multi-task Gaussian process models for genome-wide association studies

Add to your list(s) Download to your calendar using vCal

Francesco Paolo Casale, European Bioinformatics Institute
Friday 25 September 2015, 11:00-12:00
Engineering Department, CBL Room BE-438.

If you have a question about this talk, please contact Carl Edward Rasmussen.

Population-level data, where genotype and phenotype data are available in large sample sizes, have enabled genome-wide association studies (GWAS), both in human and in a wide range of model organisms. GWAS present many critical analysis challenges that current approaches address only in isolation. Among these are confounding factors, such as population structure, which result in non-IID sample structure. Additionally, for many complex traits genetic effects can be weak and dispersed across a large number of genetic features. Finally, individual phenotypes can rarely be considered as independent and instead it is important and beneficial to model the correlation structure between them.

In this talk, I will present approaches based on multi-task Gaussian processes to comprehensively address the challenges above. The method enables testing for association between sets of genetic features and multiple (correlated) phenotypes while simultaneously accounting for non-IID sample structure in the data. I will discuss both the modeling aspects and alternative scalable exact and approximate inference schemes for applications to large datasets. Finally, I will present applications to real data with thousands of samples and tens of traits, where we find that our method outperforms established methods in GWAS .

This talk is part of the Machine Learning @ CUED series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Efficient multi-task Gaussian process models for genome-wide association studies

This talk is included in these lists:

Other lists

Other talks