2025  6

June  2

Choosing a batch size and provider for LLM training

June 27, 2025 · 4 min · 756 words

Ultra-scale Playbook - Deepspeed ZeRO

June 21, 2025 · 8 min · 1519 words

May  1

Ultra-scale Playbook - Data Parallelism

May 17, 2025 · 5 min · 940 words

April  3

Ultra-scale Playbook - Train on a single GPU

April 27, 2025 · 4 min · 797 words

TIL this week

April 25, 2025 · 1 min · 174 words

Hello World 👋

April 20, 2025 · 1 min · 36 words