Optimizing Inference Performance of Transformers on CPUs

Optimizing Inference Performance of Transformers on CPUs

Alex Kogan, Dave Dice

26 April 2021

Slides to be presented at the EuroMLSys'21 workshop


Venue : EuroMLSys'21 workshop