Blog
Archive
Books
Hiking
Search
Tags
Archive
2025
6
June
2
Choosing a batch size and provider for LLM training
June 27, 2025
· 4 min · 756 words
Ultra-scale Playbook - Deepspeed ZeRO
June 21, 2025
· 8 min · 1519 words
May
1
Ultra-scale Playbook - Data Parallelism
May 17, 2025
· 5 min · 940 words
April
3
Ultra-scale Playbook - Train on a single GPU
April 27, 2025
· 4 min · 797 words
TIL this week
April 25, 2025
· 1 min · 174 words
Hello World 👋
April 20, 2025
· 1 min · 36 words