Skip to content

The Operational AI Leap - Zero ETL for Operational AI

Overview

The Operational AI Leap demonstrates a modern paradigm shift enabling ML engineers to run real-time vector search and generative LLM inference natively inside the database engine. By connecting Google Colab Enterprise directly to live AlloyDB data and joining BigQuery Data Lakes via zero-copy federation, this demo proves how enterprise-grade AI recommendation agents can be deployed in hours with absolutely zero data movement tax.

  • Zero-ETL Architecture: Eliminating ETL pipelines by connecting ML environments directly to live operational data
  • In-Database Generative AI: Invoking Gemini LLM endpoints directly inside database SQL via secure IAM integration
  • Multi-Index Optimization: Fusing Dense Vectors, Sparse Vectors, and Full-Text Search into a single unified plan
  • Lakehouse Federation: Executing real-time, zero-copy joins between live databases and BigQuery Data Lakes
  • Compute Isolation: Offloading high-throughput AI workloads onto dynamically scaling Read Pools

The demo proves that Zero-ETL workflows accelerate AI deployment cycles from months to hours while protecting primary application performance.

Demo Architectural Flow Diagram

flowchart TD
    subgraph Client ["Client ML Workspace (Colab Enterprise)"]
        ColabNotebook["Colab Enterprise Notebook
(init_connection_pool)"] end subgraph AlloyDB ["AlloyDB for PostgreSQL (Operational Database)"] direction TB subgraph Extensions ["AI Extension Layer"] GoogleML["google_ml_integration
(gemini-2.5-flash-lite-global)"] BigQueryFDW["bigquery_fdw
(Server & User Mappings)"] end subgraph Data ["Operational Schema"] LiveProducts["Products Table
(DML ai.generate)"] Indices["Multi-Index Optimizations
(ScaNN, HNSW, and GIN)"] LiveProducts -.->|"Index Mappings"| Indices end subgraph QueryEngine ["Query Planner"] RRF["SQL Reciprocal Rank Fusion
(Outer Joins & Ranking)"] ReadPools["Scaling Read Pools
(Offloads Analytical Compute)"] end LiveProducts --> RRF BigQueryFDW -->|"Manual Foreign Tables"| RRF end subgraph DataCloud ["Google Cloud GenAI & Data Lake"] VertexAI["Vertex AI API
(Embedding & Generative Completions)"] BigQueryLake["BigQuery Analytics
(Historical order_items Dataset)"] end ColabNotebook -->|"Direct connection pool (0ms lag)"| LiveProducts GoogleML -->|"Natively Call ai.generate()"| VertexAI BigQueryFDW -->|"Zero-Copy Federated Join"| BigQueryLake RRF -->|"The Magic Moment Context Block"| GoogleML VertexAI -->|"Return Personalized Recommendation Text Card"| ColabNotebook ReadPools -.->|"Isolates Query Compute"| RRF style ColabNotebook fill:#4285F4,stroke:#3c4043,color:#FFFFFF style LiveProducts fill:#34A853,stroke:#3c4043,color:#FFFFFF style Indices fill:#F8F9FA,stroke:#34A853,color:#202124 style GoogleML fill:#FBBC04,stroke:#3c4043,color:#202124 style BigQueryFDW fill:#FBBC04,stroke:#3c4043,color:#202124 style RRF fill:#4285F4,stroke:#3c4043,color:#FFFFFF style ReadPools fill:#F8F9FA,stroke:#4285F4,color:#202124 style VertexAI fill:#EA4335,stroke:#3c4043,color:#FFFFFF style BigQueryLake fill:#34A853,stroke:#3c4043,color:#FFFFFF

Getting Started

Prerequisites

  • Google Cloud Project with billing enabled.
  • Google Cloud SDK (gcloud) installed and configured.
  • Permissions to enable necessary Google Cloud APIs (e.g., Alloy DB, Gemini Enterprise Agent Platform)
  • Access to a Google Cloud environment where you can deploy resources and run Jupyter notebooks (e.g., Colab Enterprise).

Quick Deploy via Terraform

  1. Follow Option 1: Quick Deploy via Terraform section from Cymbal Shops StyleSearch AlloyDB AI Demo's README document.

NOTE: Set TF_VAR_argolis to true if you are preparing this demo on Argolis Environment.

Configure Colab Enterprise

  1. From Google Cloud Console, search "Colab" from search box and click Colab Enterprise menu.
  2. Click Runtime template menu item from Colab Enterprise left sidebar.
  3. Open menu by click three dots Actions menu of Default runtime templates choose Clone option.
  4. From Create new runtime template page's first step - Runtime basics, input Default with demo-vpc as Display name box.
  5. Ignore Configure compute and Environment steps and choose Networking and security page.
  6. Change the Network to demo-vpc by clicking the item from menu.
  7. Change the Subnetwork to demo-vpc by clicking the item from menu.
  8. Click Create button at the bottom to create new runtime template
  9. Return to Runtime templates page and Open menu by click three dots Actions menu of Default with demo-vpc runtime templates choose Create runtime option.
  10. Click Create button at the bottom to create new runtime.
  11. Make a note the name of runtime that you just created.

NOTE: This is required because Colab Enterprise runtime need to be deployed in same VPC with Alloy DB Cluster and Instance for private connection.

Import a Jupyter Notebook to Colab Enterprise

  1. From Google Cloud Console, search "Colab" from search box and click Colab Enterprise menu.
  2. Click Import notebooks button and choose URL as Import source
  3. Copy below notebook URL to Notebook URLs input box.

  4. Notebook URL: https://raw.githubusercontent.com/GoogleCloudPlatform/cloud-solutions/refs/heads/main/projects/operational-ai-leap/001-after-quick-deploy.ipynb

  5. Click Import button at the bottom to create new notebook file.

Change Colab Enterprise runtime

  1. From the notebook page you imported from previous step.
  2. Click small triangle button at the right top conner before button.
  3. Choose Change runtime type menu from Additional connection options menu.
  4. From Connect to Agent Platform Runtime page, click Runtimes combo box.
  5. Choose the Runtime that you created from previous step.
  6. Check the value of Network and Subnetwork is demo-vpc
  7. Click Connect button at the bottom to connect to new runtime.
  8. From now follow the instructions from the notebook you imported.

Special thanks

I would like to extend special thanks to Paul Ramsey (paulramsey@) for his excellent Cymbal Shops StyleSearch AlloyDB AI Demo which served as the foundation for this demo.

License

Please refer to the LICENSE file for details.

Disclaimer

This is NOT an officially supported Google product.

This software is provided "as is", without warranty of any kind, expressed or implied, including but not limited to, the warranties of merchantability, fitness for a particular purpose, and/or infringement.

See LICENSE file for additional details.