Ultrascale Playbook - Pipeline Parallelism

Notes on training LLMs using pipeline parallelism

October 25, 2025 · 14 min · 2921 words

Python in practice - Abstraction design

Learnings from reading the python code for semlib library

October 10, 2025 · 3 min · 551 words

Pydata MCR talk on training LLMs

My talk on training LLMs at Pydata MCR

September 25, 2025 · 1 min · 145 words

Pydantic Logfire is awesome 🔥

Observability platform for Python applications

September 20, 2025 · 5 min · 953 words

Distributed communication for GPUs (part 2)

Introduction to collective communication operations used for distributed training.

September 13, 2025 · 13 min · 2568 words