University of Cambridge > Talks.cam > tcg40's list > AI Accelerators 101

AI Accelerators 101

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact Tobias Grosser.

LLMs are arguably among the largest technology investments since the moon landing, and rely on custom hardware accelerators both for training and inference. The talk will cover accelerating LLM transformer architectures using the combination of a compiler and a systolic compute array. The key enabler to achieving meaningful performance using the systolic compute array are deep program analyses of the model architecture in the Neuron Compiler. I will briefly report on our effort to build a verified (using Lean) compiler from XLA /HLO to the Trainium ISA .

Daniel Kroening is a Senior Principal Applied Scientist at Amazon, where he works on the correctness of the Neuron Compiler for distributed training and inference. Prior to joining Amazon, he worked as a Professor of Computer Science at the University of Oxford and is the co-founder of Diffblue Ltd., a University spinout that develops AI that targets code and code-like artefacts. He wrote the CBMC (for C), JBMC (for Java) and EBMC (for SystemVerilog) model checkers; CBMC is the engine of Kani (for verifying unsafe Rust). He has received the Semiconductor Research Corporation (SRC) Inventor Recognition Award, an IBM Faculty Award, a Microsoft Research SEIF Award, the Wolfson Research Merit Award, and the Rance Cleaveland Test-of-Time tool award. He serves on the CAV steering committee and was co-chair of FLOC 2018 , EiC of Springer FMSD , and is co-author of the textbooks on Decision Procedures and Model Checking.

\

We will have Pizza! Register such that we can get the right amount of food:

https://docs.google.com/forms/d/1RlUWlAroGtv-cePwe2lL118KbQFzTX647tn4zcmCbuA/viewform?edit_requested=true

This talk is part of the tcg40's list series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2025 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity