Index
Serve (online inference) a model using a single TPU and GKE
To better understand how TPUs work on GKE, please read the doc TPUs in GKE introduction.
This directory contains files for JAX Model inference and serving. You can find step-by-step instructions in the quickstart guide.