Google Cloud and NVIDIA expand AI innovation across industries at GTC 2026

2026-03-16

11 min read

by Mark Lohmeyer

Tags:

AI & Machine Learning

Partners

Compute

Receive daily AI-curated summaries of engineering articles from top tech companies worldwide.

Google Cloud and NVIDIA announce expanded AI infrastructure partnership at GTC 2026, introducing new hardware, software, and platform innovations.

•G4 VMs powered by NVIDIA RTX Pro 6000 Server Edition support models from 30B to 100B+ parameters using FP4 precision, delivering higher throughput and reduced latency for real-time AI agents
•Fractional G4 VMs (preview) offer 1/2, 1/4, and 1/8 GPU slice sizes via NVIDIA vGPU technology, enabling right-sized and cost-efficient resource allocation
•Google Cloud plans to offer NVIDIA Vera Rubin NVL72 rack-scale systems in H2 2026, integrated into the AI Hypercomputer architecture
•NVIDIA Dynamo integrates with GKE Inference Gateway to provide a modular, open-source inference control plane for MoE and agentic workloads
•

Related Articles