Blog

Archive
Books
Projects
Hiking
Search
Tags

Gke

Where vLLM Cold-Start Time Goes on GKE?

Measuring vLLM cold-start bottlenecks on GKE and evaluating ways to reduce time to first request.

April 11, 2026 · 12 min · 2362 words

© dudeperf3ct · Powered by Hugo & PaperMod