Google Cloud AI/ML infrastructure¶
This folder contains reference guides and blueprints that compile best practices, and prescriptive guidelines for running large-scale AI/ML workloads, including Large Language and Generative AI models, on Google Cloud AI/ML infrastructure.
- TPU Training on GKE. This is a reference guide for executing large-scale training workloads on Cloud TPUs in Google Kubernetes Engine (GKE).