BSU Seminar: “Unsupervised substructure discovery in mass spectrometry metabolomics data”
- 👤 Speaker: Dr. Simon Rogers, University of Glasgow
- 📅 Date & Time: Thursday 08 March 2018, 14:30 - 15:30
- 📍 Venue: Large Seminar Room, 1st Floor, Institute of Public Health, University Forvie Site, Robinson Way, Cambridge
Abstract
Identification of molecules observed in data derived from mass spectrometry experiments remains is very difficult, and hinders the wider application of high throughput untargeted metabolomics. The standard analysis pipelines compare observed spectra with databases of known molecules, but these databases have very low coverage resulting in the majority of the measured molecules being unidentified. Here, I will present an alternative approach for exploring datasets from untargeted metabolomics experiments that uses a technique from text mining (Latent Dirichlet Allocation; LDA ) to decompose the observed spectra across a set of shared components. We show that these often represent molecular substructures. In other words, we are able to break unknown molecules down into building blocks, many of which can be identified. I will show results on a number of standards and real datasets, as well as describe some future directions of this work.
Series This talk is part of the MRC Biostatistics Unit Seminars series.
Included in Lists
This talk is not included in any other list.
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Dr. Simon Rogers, University of Glasgow
Thursday 08 March 2018, 14:30-15:30