Skip to content

Container Selection Plan: azure-blob-poc-selection-us-east

Container Analysis Summary

Container Name Region Total Size Blob Sizes Blob Count File Types Permissions Complexity Notes
finance eastus2 ~41.2 GB Mix (KB to 100MB) 4,285 Backups (.bak), SQL, XML, Docs High (Net Rules) Strongest candidate. Good size mix and region alignment.
engineering eastus ~1.02 GB Mostly Small (KB/MB) 4,525 Dev artifacts (.whl, .parquet, .js) High (Net Rules) Good secondary. Excellent for small-file performance testing.
human-resources centralus N/A Mix 4,150 Docs High Wrong Region.
legal centralus N/A Mix 3,980 Legal Docs High Wrong Region.
marketing canadaeast N/A Mix 3,860 Marketing Assets High Wrong Region.
product canadacentral N/A Mix 3,894 Product Specs High Wrong Region.
sales canadacentral N/A Mix 4,247 Sales Data High Wrong Region.

Recommendations for proof of concept migration

For the Proof of Concept (POC) migration to the US EAST region, the finance container is the optimal selection.

Primary Selection: finance

Reasoning:

  1. Region Alignment: Located in eastus2, which is a primary US East region, minimizing latency and egress costs during the POC.
  2. Data Representative:
    • Size & Scale: With a total size of ~41.2 GB, it provides a substantial dataset to validate throughput.
    • Object Mix: It contains a realistic mix of "larger" files (Database backups ~100MB) and smaller documents (PDFs, XMLs). This variation is critical for testing multi-part upload settings and throughput scaling.
  3. Complexity: The container enforces strict Network Rules (defaultAction: Deny), providing a necessary test case for mapping Azure VNet/IP restrictions to Google Cloud VPC Service Controls or IAM conditions.

Secondary Selection: engineering

Reasoning: The engineering container (eastus) is a strong alternative or add-on. While smaller (~1GB), it contains a high density of small files (code, logs), which is excellent for stress-testing "metadata-heavy" operations and evaluating object creation latency (Requests Per Second) rather than just bandwidth.

Conclusion

Start the POC with finance to validate throughput and large-file handling, optionally adding engineering to test small-file performance.