Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Mechanistic Understanding of Language Models in Arithmetic Reasoning and Code Generation

Add to your list(s) Download to your calendar using vCal

Ziyu Yao, George Mason University
Thursday 05 December 2024, 14:00-15:00
https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09.

If you have a question about this talk, please contact Tiancheng Hu.

Abstract:

Transformer-based language models (LMs) have demonstrated the promise in solving more and more complicated tasks, yet alongside their advancements are growing concerns on their safety and reliability. These concerns primarily stem from our limited understanding of these LMs and the difficulty in interpreting their behaviors. In this talk, I will present our two recent projects towards forming a mechanistic understanding of LMs. In the first project (published at ACL ’24), we explain how Chain-of-Thoughts (CoT) elicit the arithmetic reasoning of LMs by looking into the neuron activation inside the models; in the second project (ongoing), we generalize the analysis to understand the mechanism of how LMs solve the structured code generation problem. Finally, I will conclude the talk by briefly sharing our other effort along the line of LM planning and interpretability.

Bio:

Ziyu Yao (https://ziyuyao.org/) is an Assistant Professor in the Department of Computer Science at George Mason University, where she co-leads the George Mason NLP group (https://nlp.cs.gmu.edu/). Her research topics cover LLMs, semantic parsing/code generation, model interpretability, and human-AI interaction. Her work has been funded by National Science Foundation, Virginia Commonwealth Cyber Initiative, Microsoft’s Accelerating Foundation Models Research Program, among others. She has regularly served as an area chair at top-tier NLP /AI conferences and was the Diversity & Inclusion Co-Chair at NAACL 2024 . Prior to George Mason, she graduated with a Ph.D. degree in Computer Science and Engineering from the Ohio State University in 2021, where she was awarded the prestigious Presidential Fellowship. She was also selected as a rising star in EECS by UC Berkeley in 2021.

This talk is part of the Language Technology Lab Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Mechanistic Understanding of Language Models in Arithmetic Reasoning and Code Generation

This talk is included in these lists:

Other lists

Other talks