Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

New Advances in Multimodal Reasoning

Add to your list(s) Download to your calendar using vCal

Prof. Paul Liang (MIT)
Thursday 20 March 2025, 14:00-15:00
https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09.

If you have a question about this talk, please contact Shun Shao.

Abstract: Today’s language models are increasingly capable of reasoning over multiple steps with verification and backtracking to solve challenging problems. However, multimodal reasoning models that can reason over an integrated set of modalities such as text, images, audio, video, and knowledge graphs are sorely lacking, and can pave the way for a next frontier of AI. I will describe our group’s work on advancing the frontiers of multimodal reasoning, from new multimodal reasoning benchmarks to training multimodal foundation models with modern reasoning approaches, and applications to social understanding and education.

Bio: Paul Liang is an Assistant Professor at the MIT Media Lab and MIT EECS . His research advances the foundations of multisensory artificial intelligence to enhance the human experience. He is a recipient of the Siebel Scholars Award, Waibel Presidential Fellowship, Facebook PhD Fellowship, Center for ML and Health Fellowship, Rising Stars in Data Science, and 3 best paper awards. Outside of research, he received the Alan J. Perlis Graduate Student Teaching Award for developing new courses on multimodal machine learning.

This talk is part of the Language Technology Lab Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

New Advances in Multimodal Reasoning

This talk is included in these lists:

Other lists

Other talks