Ultrascale Playbook - Pipeline Parallelism

Notes on training LLMs using pipeline parallelism

October 25, 2025 · 14 min · 2918 words

Pydata MCR talk on training LLMs

My talk on training LLMs at Pydata MCR

September 25, 2025 · 1 min · 145 words

Choosing a batch size and provider for LLM training

Notes on choosing appropriate batch size and compute for training LLMs

June 27, 2025 · 4 min · 751 words

Ultra-scale Playbook - ZeRO Sharding

Notes on training LLMs using sharding strategies

June 21, 2025 · 8 min · 1557 words

Ultra-scale Playbook - Data Parallelism

Notes on training LLMs using data parallelism strategy

May 17, 2025 · 5 min · 945 words