Where vLLM Cold-Start Time Goes on GKE? Measuring vLLM cold-start bottlenecks on GKE and evaluating ways to reduce time to first request.