Check out our new blog, Deeper Learning. In our newest post, “Repeat After Me: Transformers are Better than State Space Models at Copying, authors Samy Jelassi, David Brandfonbrener, Sham Kakade and Eran Malach show the improved efficiency of State Space Models sacrifices some core capabilities for modern LLMs.

