gpu

Browse all articles, tutorials, and guides about gpu

2posts

Posts

⌘K

2026-06-24|7 min read

Kubernetes 1.37 Just Locked Its Feature Set: What Made the Cut

The enhancements freeze for Kubernetes 1.37 landed on June 17, so the shape of the August release is now decided. GPU partitioning keeps maturing for AI workloads, and a cgroup v1 change will stop some kubelets from starting. Here is what is locked in and what to check before you upgrade.

Kubernetes

2026-05-26|11 min read

How NetEase Games Cut LLM Cold Starts From 42 Minutes to 30 Seconds Using Fluid

NetEase Games published a Kubernetes case study walking through how they took their serverless GPU inference cold-start time from 42 minutes down to under 30 seconds. The bottleneck isn't the GPU. It's the 60GB model weights crossing a region. Here is what they did with the CNCF Fluid project and how to apply the same pattern even if you are not on Kubernetes.