attention 4 FlashAttention Sep 19, 2024 Deja Vu, but Make It Linear: The KV Cache Aug 23, 2024 Attention Is a Third of What You Need: QKV, Dot Products, and the Other Two-Thirds Jul 10, 2024 Transformers: More Than Meets the Eye Jun 13, 2024