Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Linear Attention for Efficient Transformers

Add to your list(s) Download to your calendar using vCal

Isaac Reid (University of Cambridge)
Wednesday 30 October 2024, 11:00-12:30
Cambridge University Engineering Department, CBL Seminar room BE4-38..

If you have a question about this talk, please contact Xianda Sun.

Attention may be all you need, but that doesn’t mean it comes cheap. The Achilles’ Heel of the wildly successful Transformer architecture is its quadratic time- and space-complexity scaling with respect to the length of the input token sequence. A diverse taxonomy of methods has been proposed to remedy this bottleneck and recover linear complexity, including making attention local, sparse or low rank. We will explore the respective strengths and weaknesses of these approaches, discuss theoretical guarantees (or the lack thereof), and consider possible directions for future work.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Linear Attention for Efficient Transformers

This talk is included in these lists:

Other lists

Other talks