Media Summary: local attention in LMs / NMT, routing attention, longformers, linformers slides: ...
Lecture 20 Efficient Transformers Mit - Detailed Analysis & Overview
local attention in LMs / NMT, routing attention, longformers, linformers slides: ...