Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Training for Deployment: Methods for Small and Efficient NLP

Add to your list(s) Download to your calendar using vCal

Alexander Rush, Cornell Tech
Thursday 20 May 2021, 15:00-16:00
https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09.

If you have a question about this talk, please contact Marinela Parovic.

Natural language models for translation and classification work relatively well, or at least well enough that there is demand for widespread use in real systems. Models developed for research however do not naturally translate to deployment scenarios, particularly on resource constrained devices like mobile phones. In this talk I will discuss two axes that make it difficult to deploy NLP models in practice: a) Serial generation in translation models makes them difficult to optimize, and b) Fine-tuned parameter size in classification makes models difficult to deploy to end-users. I propose two approaches that aim to circumvent these issues, and discuss some practical work on deploying large NLP models on edge devices.

This talk is part of the Language Technology Lab Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Training for Deployment: Methods for Small and Efficient NLP

This talk is included in these lists:

Other lists

Other talks