- Constitutional Classifiers: Protecting LLM's with Mini Bodyguards
- Introduction to Reinforcement Learning and the Bellman Equations
- How All-Reduce Affects the Backward Pass
- Understanding Megatron-Style Tensor Parallelism
- Understanding the SwiGLU FeedForward Layers
- Understanding KL Divergence in Reinforcement Learning