Skip to content

Google Cloud AI/ML infrastructure

This folder contains reference guides and blueprints that compile best practices, and prescriptive guidelines for running large-scale AI/ML workloads, including Large Language and Generative AI models, on Google Cloud AI/ML infrastructure.

  • TPU Training on GKE. This is a reference guide for executing large-scale training workloads on Cloud TPUs in Google Kubernetes Engine (GKE).