Skip to content

Index

Serve (online inference) a model using a single TPU and GKE

To better understand how TPUs work on GKE, please read the doc TPUs in GKE introduction.

This directory contains files for JAX Model inference and serving. You can find step-by-step instructions in the quickstart guide.