cs336
an archive of posts in this category
| Jan 31, 2026 | FlashAttention-2 in Triton: From GPU Mental Models to Kernel Performance |
|---|---|
| Jul 24, 2025 | Backpropagation: From Intuition to FLOPs |
| Jul 14, 2025 | DL Under the Hood: Tensors, Views, and FLOPs |
| Jul 01, 2025 | How Computers Store Data in Memory: Brief Intro |
| Jun 16, 2025 | Tokenization: Background and BPE Explanation |