2025  13

October  2

Ultrascale Playbook - Pipeline Parallelism

October 25, 2025 · 14 min · 2918 words

Python in practice

October 10, 2025 · 3 min · 551 words

September  4

Pydata MCR talk on training LLMs

September 25, 2025 · 1 min · 145 words

Pydantic Logfire is awesome 🔥

September 20, 2025 · 5 min · 952 words

Distributed communication for GPUs (part 2)

September 13, 2025 · 13 min · 2567 words

Distributed communication for GPUs (part 1)

September 9, 2025 · 11 min · 2146 words

August  1

Authenticating AWS with EKS

August 16, 2025 · 3 min · 584 words

June  2

Choosing a batch size and provider for LLM training

June 27, 2025 · 4 min · 751 words

Ultra-scale Playbook - ZeRO Sharding

June 21, 2025 · 8 min · 1557 words

May  1

Ultra-scale Playbook - Data Parallelism

May 17, 2025 · 5 min · 945 words

April  3

Ultra-scale Playbook - Train on a single GPU

April 27, 2025 · 4 min · 801 words

TIL this week

April 25, 2025 · 1 min · 174 words

Hello World 👋

April 20, 2025 · 1 min · 36 words