University of Cambridge > Talks.cam > Language Technology Lab Seminars > Emergence of Linear Representations in LMs (NYU)

Log in

University Account

External (via Google)

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Emergence of Linear Representations in LMs (NYU)

Download to your calendar using vCal

Dr. Shauli Ravfogel (NYU)
Tuesday 28 October 2025, 11:00-12:00
GR03, English Faculty Building, 9 West Road, Sidgwick Site and online https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09.

If you have a question about this talk, please contact Shun Shao .

Abstract: Recent work suggests that language models (LMs) encode many human-interpretable concepts as approximately linear directions in representation space. I first survey evidence for this “linear concept” hypothesis and show how it motivates steering methods—targeted interventions that causally modify model behavior. I then focus on truthfulness, demonstrating that LMs allocate a direction separating true from false assertions. Using an analytically tractable toy transformer, I present a plausible mechanism for how such linear structure emerges and how models exploit it to solve a factuality-related task. Taken together, these results bring us closer to understanding why “simple” geometry arises in LM representations.

Bio: Dr Shauli Ravfogel is a Postdoctoral Researcher and Faculty Fellow at the NYU Center of Data Science. He earned his PhD from the Natural Language Processing Lab at Bar-Ilan University, supervised by Prof. Yoav Goldberg. His research focuses on analyzing and controlling the internal representations of generative models, particularly language models. He studies how neural networks encode structured information, use it to solve tasks, and represent interpretable concepts. He aims—sometimes even successfully—to develop mathematically principled approaches to interpretability. He is particularly interested in understanding how simple structures, such as concept-aligned linear subspaces, emerge as a byproduct of the language modeling objective, and how such structures can be used to steer and control models. During his PhD, he worked on techniques to selectively control information in neural representations, with some fun linguistic side tours. More recently, he has explored framing language models as causal models and tackling questions of learnability in a controlled setting.

This talk is part of the Language Technology Lab Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Emergence of Linear Representations in LMs (NYU)

📅 Download to calendar (vCal)

👤 Speaker: Dr. Shauli Ravfogel (NYU)
📅 Date & Time: Tuesday 28 October 2025, 11:00 - 12:00
📍 Venue: GR03, English Faculty Building, 9 West Road, Sidgwick Site and online https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09

Questions? Contact Shun Shao

Abstract

Series This talk is part of the Language Technology Lab Seminars series.

Included in Lists

Note: Ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

Emergence of Linear Representations in LMs (NYU)

This talk is included in these lists:

Emergence of Linear Representations in LMs (NYU)

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

Emergence of Linear Representations in LMs (NYU)

This talk is included in these lists:

Other lists

Other talks

Emergence of Linear Representations in LMs (NYU)

Abstract

Included in Lists