Bits and Bytes
Shreyas Srivastava
Comparing techniques in LLM application development
LLM inference optimization: Speculative Decoding
Implementing distributed transformer MLP layer in pytorch
Serving latency considerations for embedding recommender systems
Mixed precision training
Voyager LLM paper notes
Deep learning book(Goodfellow) Chapter 8 Optimization
Deep learning book(Goodfellow) Chapter 7 Regularization
Deep learning book(Goodfellow) Chapter 6
Optimizers
Use einops to patchify image
Tech talk notes on "Building Software Systems At Google and Lessons Learned"
Work in progress Notes CIFAR10 resnet exploration