Ultra-scale Playbook - ZeRO Sharding

Notes on training LLMs using sharding strategies

June 21, 2025 · 8 min · 1583 words

Ultra-scale Playbook - Data Parallelism

Notes on training LLMs using data parallelism strategy

May 17, 2025 · 5 min · 967 words

Ultra-scale Playbook - Train on a single GPU

Notes on Ultra-scale Playbook - training LLM on a single GPU

April 27, 2025 · 4 min · 803 words

TIL this week

Interesting things I learnt this week

April 25, 2025 · 1 min · 174 words

Hello World 👋

Introduction and motivation

April 20, 2025 · 1 min · 36 words