DRA: A new era of Kubernetes device management with Dynamic Resource Allocation

2026-03-25

6 min read

by Morten Torkildsen

Tags:

AI & Machine Learning

GKE

Containers & Kubernetes

Read Original

Get the latest tech trends every morning

Receive daily AI-curated summaries of engineering articles from top tech companies worldwide.

Endigest AI Core Summary

This post introduces Dynamic Resource Allocation (DRA), a new Kubernetes-native framework for managing hardware accelerators like GPUs and TPUs at scale.

•DRA reached stable status in Kubernetes 1.34, replacing the Device Plugin framework which could only express hardware needs as simple integers
•ResourceSlice API lets drivers publish granular hardware details (memory, cores, architecture, NUMA topology) to the cluster
•ResourceClaim API allows users to specify precise requirements such as "any GPU with at least 40 GB of VRAM" or inter-device PCIe constraints
•DRA eliminates manual node pinning by making the Kube-scheduler natively aware of hardware attributes and topology
•NVIDIA donated its DRA Driver for GPUs and Google donated the DRA driver for TPUs to the Kubernetes community; DRA is now GA in GKE

DRA: A new era of Kubernetes device management with Dynamic Resource Allocation

Get the latest tech trends every morning

Endigest AI Core Summary

Related Articles

Developer's guide to Gemini Enterprise and A2UI integration

Cloud CISO Perspectives: How to build an AI-ready security program for the public sector

Port 8080 is now available in Vercel Sandboxes

Run Docker containers inside Vercel Sandbox