Mar 12024 Paper page — Simple linear attention language models balance the recall-throughput tradeoff Join the discussion on this paper page.