JupyterHub Profiles

Default Profiles

By default, there are 3 pre-set profiles for JupyterHub:

Profiles Page

As the description for each profiles explains, each profiles uses a different resource.

First profile uses CPUs and uses the image: jupyter/tensorflow-notebook with tag python-3.10
Second profile uses TPUs and will only work when TPU is enabled on the cluster. It uses the image jupyter/tensorflow-notebook with tag python-3.10
Second profile uses 2 T4 GPUs and the image: jupyter/tensorflow-notebook:python-3.10 with tag python-3.10 [^1]
Third profile uses 2 A100 GPUs and the image: jupyter/tensorflow-notebook:python-3.10 with tag python-3.10 [^1]

Editting profiles

You can change the image used by these profiles, change the resources, and add specific hooks to the profiles

Within the config.yaml, the profiles sit under the singleuser key:

singleuser:
    cpu:
        ...
    memory:
        ...
    image:
        name: jupyter/tensorflow-notebook
        tag: python-3.10
    ...
    profileList:
    ...

Image

The default image used by all three of the profiles is jupyter/tensorflow-notebook:python-3.10

For profile 1, it uses the: default: true field. This means that all the default configs under singleuser are used.
For profile 2 and 3, the images are defined under kubespawner_override

    display_name: "Profile2 name"
        description: "description here"
        kubespawner_override:
            image: jupyter/tensorflow-notebook:python-3.10
    ...

Kubespanwer_override is a dictionary with overrides that gets applied through the Kubespawner. [^2]

More images of tensorflow can be found here

Resources

Each of the users get a part of the memory and CPU and the resources are by default:

    cpu:
        limit: .5
        guarantee: .5
    memory:
        limit: 1G
        guarantee: 1G

The limit if the reousrce sets a hard limit on how much of that resource can the user have. The guarantee meaning the least amount of resource that will be available to the user at all times.

Similar to overriding images, the resources can also be overwritten by using kubespawner_override:

    kubespawner_override:
        cpu_limit: .7
        cpu_guarantee: .7
        mem_limit: 2G
        mem_guarantee: 2G
        nvidia.com/gpu: "2"

Node/GPU

JupyterHub config allows the use of nodeSelector. This is the way the profiles specify which node/GPU it wants

nodeSelector:
    iam.gke.io/gke-metadata-server-enabled: "true"
    cloud.google.com/gke-accelerator: "nvidia-tesla-t4"

Override using kubespwaner_override:

    kubespawner_override:
        node_selector:
          cloud.google.com/gke-accelerator: "nvidia-tesla-a100"

The possible GPUs are:

nvidia-tesla-k80
nvidia-tesla-p100
nvidia-tesla-p4
nvidia-tesla-v100
nvidia-tesla-t4
nvidia-tesla-a100
nvidia-a100-80gb
nvidia-l4

TPUs

JupyterHub profiles has a TPU option. It utilizes the nodeSelector, and the annotations cloud.google.com/gke-tpu-accelerator and cloud.google.com/gke-tpu-topology to select the TPU nodes.

We add the following annotations to kupespawner_override

extra_resource_limits:
    google.com/tpu: "4"
node_selector:
    cloud.google.com/gke-tpu-accelerator: "tpu-v4-podslice"
    cloud.google.com/gke-tpu-topology: "2x2x1"

Currently the template only provisions a ct4p-hightpu-4t TPU machine with placement 2x2x1.

Example profile

Example of a profile that overrides the default values:

  - display_name: "Learning Data Science"
    description: "Datascience Environment with Sample Notebooks"
    kubespawner_override:
        cpu_limit: .5
        cpu_guarantee: .5
        mem_limit: 1G
        mem_guarantee: 1G
    image: jupyter/datascience-notebook:2343e33dec46
    lifecycle_hooks:
        postStart:
        exec:
            command:
            - "sh"
            - "-c"
            - >
                gitpuller https://github.com/data-8/materials-fa17 master materials-fa;

Additional Overrides

With kubespanwer_override there are additional overrides that could be done, including lifecycle_hooks, storage_capacity, and storage class Fields can be found here

[^1]: If using Standard clusters, the cluster must have at least 2 of the GPU type ready [^2]: More information on Kubespawner here