transformer on Sparse Notes

transformer on Sparse Notes https://sparsenotes.com/tags/transformer/ Recent content in transformer on Sparse Notes https://sparsenotes.com/images/og-default.png https://sparsenotes.com/images/og-default.png Hugo -- gohugo.io Sat, 30 May 2026 00:00:00 +0000 Attention Is All You Need (2017): The Architecture That Ate Machine Learning https://sparsenotes.com/posts/2026/05/papers/attention-is-all-you-need/ Sat, 30 May 2026 00:00:00 +0000 https://sparsenotes.com/posts/2026/05/papers/attention-is-all-you-need/ The 2017 Vaswani et al. paper that introduced the Transformer — replacing recurrence and convolution with stacked self-attention, and quietly becoming the substrate for nearly every frontier model of the decade that followed.