Mar 12024 Paper page — Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models Join the discussion on this paper page.