跳转至

Infrastructure

Systems for training and serving at scale.

Topics

  • Hardware (GPU/TPU), scheduling, and packing
  • Serving stacks (vLLM, TGI, custom)
  • Caching, autoscaling, and cost controls
  • Security and multi-tenancy