Infrastructure¶ Systems for training and serving at scale. Topics¶ Hardware (GPU/TPU), scheduling, and packing Serving stacks (vLLM, TGI, custom) Caching, autoscaling, and cost controls Security and multi-tenancy