COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Language Technology Lab Seminars > Causal analysis of the syntactic representations of Transformers
Causal analysis of the syntactic representations of TransformersAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Marinela Parovic. The success of artificial neural networks in language processing tasks has underscored the need to understand how they accomplish their behavior, and, in particular, how their internal vector representations support that behavior. The probing paradigm, which has often been invoked to address this question, relies on the (typically implicit) assumption that if a classifier can decode a particular piece of information from the model’s intermediate representation, then that information plays a role in shaping the model’s behavior. This assumption is not necessarily justified. Using the test case of everyone’s favorite syntactic phenomenon – English subject-verb number agreement – I will present an approach that provides much stronger evidence for the causal role of the encoding of a particular linguistic feature in the model’s behavior. This approach, which we refer to as AlterRep, modifies the internal representation in question such that it encodes the opposite value of that feature; e.g., if BERT originally encoded a particular word as occurring inside a relative clause, we modify the representation to encode that it is not inside the relative clause. I will show that the conclusions of this method diverge from those of the probing method. Finally, I will present a method based on causal mediation analysis that makes it possible to draw causal conclusions by applying counterfactual interventions to the inputs, contrasting with AlterRep which intervenes on the model’s internal representations. This talk is part of the Language Technology Lab Seminars series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsExperience Islam Week 2011 (12th February - 20th February) Vegetable Love: Edible plants between nature and culture Women in Academia: Skills and PracticesOther talksMarking the Bicentenary of Dostoevsky's Birth – A talk by V. Dimitriev (in Russian): On the Structure of the Idea in Dostoevsky's Works Language based Pre-training for Drug Discovery Week 9 Synchrony and synaptic signaling in the cerebellar circuit |