How to Compress Your BERT NLP Models For Very Efficient Inference

danroo

2 years ago

#Compress #BERT #NLP #Models #Efficient #Inference

“Neural Magic”

This video covers SOTA compression research that addresses common Transformer setbacks, including their large size and …

source