The Operational AI Leap - Zero ETL for Operational AI¶
Overview¶
The Operational AI Leap demonstrates a modern paradigm shift enabling ML engineers to run real-time vector search and generative LLM inference natively inside the database engine. By connecting Google Colab Enterprise directly to live AlloyDB data and joining BigQuery Data Lakes via zero-copy federation, this demo proves how enterprise-grade AI recommendation agents can be deployed in hours with absolutely zero data movement tax.
- Zero-ETL Architecture: Eliminating ETL pipelines by connecting ML environments directly to live operational data
- In-Database Generative AI: Invoking Gemini LLM endpoints directly inside database SQL via secure IAM integration
- Multi-Index Optimization: Fusing Dense Vectors, Sparse Vectors, and Full-Text Search into a single unified plan
- Lakehouse Federation: Executing real-time, zero-copy joins between live databases and BigQuery Data Lakes
- Compute Isolation: Offloading high-throughput AI workloads onto dynamically scaling Read Pools
The demo proves that Zero-ETL workflows accelerate AI deployment cycles from months to hours while protecting primary application performance.
Demo Architectural Flow Diagram¶
flowchart TD
subgraph Client ["Client ML Workspace (Colab Enterprise)"]
ColabNotebook["Colab Enterprise Notebook
(init_connection_pool)"]
end
subgraph AlloyDB ["AlloyDB for PostgreSQL (Operational Database)"]
direction TB
subgraph Extensions ["AI Extension Layer"]
GoogleML["google_ml_integration
(gemini-2.5-flash-lite-global)"]
BigQueryFDW["bigquery_fdw
(Server & User Mappings)"]
end
subgraph Data ["Operational Schema"]
LiveProducts["Products Table
(DML ai.generate)"]
Indices["Multi-Index Optimizations
(ScaNN, HNSW, and GIN)"]
LiveProducts -.->|"Index Mappings"| Indices
end
subgraph QueryEngine ["Query Planner"]
RRF["SQL Reciprocal Rank Fusion
(Outer Joins & Ranking)"]
ReadPools["Scaling Read Pools
(Offloads Analytical Compute)"]
end
LiveProducts --> RRF
BigQueryFDW -->|"Manual Foreign Tables"| RRF
end
subgraph DataCloud ["Google Cloud GenAI & Data Lake"]
VertexAI["Vertex AI API
(Embedding & Generative Completions)"]
BigQueryLake["BigQuery Analytics
(Historical order_items Dataset)"]
end
ColabNotebook -->|"Direct connection pool (0ms lag)"| LiveProducts
GoogleML -->|"Natively Call ai.generate()"| VertexAI
BigQueryFDW -->|"Zero-Copy Federated Join"| BigQueryLake
RRF -->|"The Magic Moment Context Block"| GoogleML
VertexAI -->|"Return Personalized Recommendation Text Card"| ColabNotebook
ReadPools -.->|"Isolates Query Compute"| RRF
style ColabNotebook fill:#4285F4,stroke:#3c4043,color:#FFFFFF
style LiveProducts fill:#34A853,stroke:#3c4043,color:#FFFFFF
style Indices fill:#F8F9FA,stroke:#34A853,color:#202124
style GoogleML fill:#FBBC04,stroke:#3c4043,color:#202124
style BigQueryFDW fill:#FBBC04,stroke:#3c4043,color:#202124
style RRF fill:#4285F4,stroke:#3c4043,color:#FFFFFF
style ReadPools fill:#F8F9FA,stroke:#4285F4,color:#202124
style VertexAI fill:#EA4335,stroke:#3c4043,color:#FFFFFF
style BigQueryLake fill:#34A853,stroke:#3c4043,color:#FFFFFF
Getting Started¶
Prerequisites¶
- Google Cloud Project with billing enabled.
- Google Cloud SDK (gcloud) installed and configured.
- Permissions to enable necessary Google Cloud APIs (e.g., Alloy DB, Gemini Enterprise Agent Platform)
- Access to a Google Cloud environment where you can deploy resources and run Jupyter notebooks (e.g., Colab Enterprise).
Quick Deploy via Terraform¶
- Follow Option 1: Quick Deploy via Terraform section from Cymbal Shops StyleSearch AlloyDB AI Demo's README document.
NOTE: Set
TF_VAR_argolisto true if you are preparing this demo on Argolis Environment.
Configure Colab Enterprise¶
- From Google Cloud Console, search "Colab" from search box and click Colab Enterprise menu.
- Click
Runtime templatemenu item from Colab Enterprise left sidebar. - Open menu by click three dots
Actionsmenu ofDefaultruntime templates chooseCloneoption. - From Create new runtime template page's first step - Runtime basics,
input
Default with demo-vpcas Display name box. - Ignore Configure compute and Environment steps and choose Networking and security page.
- Change the Network to
demo-vpcby clicking the item from menu. - Change the Subnetwork to
demo-vpcby clicking the item from menu. - Click Create button at the bottom to create new runtime template
- Return to Runtime templates page and Open menu by click three dots
Actionsmenu ofDefault with demo-vpcruntime templates chooseCreate runtimeoption. - Click Create button at the bottom to create new runtime.
- Make a note the name of runtime that you just created.
NOTE: This is required because Colab Enterprise runtime need to be deployed in same VPC with Alloy DB Cluster and Instance for private connection.
Import a Jupyter Notebook to Colab Enterprise¶
- From Google Cloud Console, search "Colab" from search box and click Colab Enterprise menu.
- Click Import notebooks button and choose
URLas Import source -
Copy below notebook URL to Notebook URLs input box.
-
Notebook URL:
https://raw.githubusercontent.com/GoogleCloudPlatform/cloud-solutions/refs/heads/main/projects/operational-ai-leap/001-after-quick-deploy.ipynb -
Click Import button at the bottom to create new notebook file.
Change Colab Enterprise runtime¶
- From the notebook page you imported from previous step.
- Click small triangle button
▾at the right top conner before∧button. - Choose
Change runtime typemenu from Additional connection options menu. - From Connect to Agent Platform Runtime page, click Runtimes combo box.
- Choose the Runtime that you created from previous step.
- Check the value of
NetworkandSubnetworkisdemo-vpc - Click Connect button at the bottom to connect to new runtime.
- From now follow the instructions from the notebook you imported.
Special thanks¶
I would like to extend special thanks to Paul Ramsey (paulramsey@) for his excellent Cymbal Shops StyleSearch AlloyDB AI Demo which served as the foundation for this demo.
License¶
Please refer to the LICENSE file for details.
Disclaimer¶
This is NOT an officially supported Google product.
This software is provided "as is", without warranty of any kind, expressed or implied, including but not limited to, the warranties of merchantability, fitness for a particular purpose, and/or infringement.
See LICENSE file for additional details.