- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
LLM From Scratch: Building TinyGPT that works
Transformer Components, Variants, Implementation Details
-
Gradient-Based Optimization: Theory, Practice, and Evolution
sgd, momentum, adagrad, adam, and beyond
-
Backpropagation: From Intuition to FLOPs
gradient calculation, flow and FLOPs
-
DL Under the Hood: Tensors, Views, and FLOPs
A deep dive into tensors, their storage, views, and compute in PyTorch.
-
How Computers Store Data in Memory: Brief Intro
fp32, fp16, and more