Noisy, sparse, nonlinear: Navigating the Bermuda Triangle of physical inference with deep filtering
- đ¤ Speaker: Carl Poelking
- đ Date & Time: Monday 11 May 2020, 17:00 - 17:30
- đ Venue: virtual ZOOM meeting ID: 263 591 6003, https://zoom.us/j/2635916003
Abstract
Capturing the microscopic interactions that determine molecular reactivity poses a challenge across the physical sciences. Even a basic understanding of the underlying reaction mechanisms can substantially accelerate materials and compound design, including the development of new catalysts or drugs. Given the difficulties routinely faced by both experimental and theoretical investigations that aim to improve our mechanistic understanding of a reaction, recent advances have focused on data-driven routes to derive structure-property relationships directly from high-throughput screens. However, even these high-quality, high-volume data are noisy, sparse and biased – placing them in a regime where machine-learning is extremely challenging. Here we show that a statistical approach based on deep filtering of nonlinear feature networks results in physicochemical models that are more robust, transparent and generalize better than standard machine-learning architectures. Using diligent descriptor design and data post-processing, we exemplify the approach using both literature and fresh data on asymmetric catalytic hydrogenation, Palladium-catalyzed cross-coupling reactions, and drug-drug synergy. We illustrate how the sparse models uncovered by the filtering help us formulate physicochemical reaction “pharmacophores”, investigate experimental bias and derive strategies for mechanism detection and classification.
Series This talk is part of the Machine learning in Physics, Chemistry and Materials discussion group (MLDG) series.
Included in Lists
- Hanchen DaDaDash
- Lennard-Jones Centre external
- Machine learning in Physics, Chemistry and Materials discussion group (MLDG)
- virtual ZOOM meeting ID: 263 591 6003, https://zoom.us/j/2635916003
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Monday 11 May 2020, 17:00-17:30