GPU Partitioning for Windows Server 2025 and Azure Stack HCI Explained

New GPU partitioning technology lets you split one GPU between many virtual machines. This is more efficient than the old way of using one GPU for one machine.

The capacity to divvy up a single physical graphics processing unit (GPU) among multiple virtual machines (VMs) has solidified as a functional technology, enabling a more granular approach to resource allocation. This 'GPU partitioning' (GPU-P) strategy, primarily driven by NVIDIA's vGPU software, allows a single GPU to be shared, offering a fraction of its power to each VM rather than dedicating the entire unit to one.

This capability is not a hypothetical future but a deployed reality. For users of Microsoft Azure Stack HCI and Windows Server 2025, the architecture for GPU partitioning is laid out in operational guides. These systems require specific host and guest drivers, with NVIDIA's Virtual GPU Manager and corresponding graphics drivers being central to the process. Dell Technologies, a prominent player in hardware solutions, specifically points to its servers' need for separate GPU-P drivers and advocates for Windows Admin Center (WAC) as a streamlined tool for managing these configurations. NVIDIA's licensing portal is identified as the source for these necessary drivers.

Read More: NVIDIA releases SANA-WM video model on May 17 2026 for faster creation

GPU partitioning | E2E Deployment and Operations Guide with Scalable Networking - 1

The implementation of GPU partitioning extends across various platforms. On Hyper-V, the virtualization technology underpinning Windows Server, partitioning can be managed through Windows Admin Center or PowerShell. Projects like 'Easy-GPU-PV' on GitHub are actively seeking to simplify this process on Windows Hyper-V environments, automating VM creation and driver installations alongside GPU partitioning. The ability to assign only a single GPU partition to any given VM is a consistent limitation across these systems.

Divergent Paths: GPU-P versus Discrete Device Assignment

While GPU partitioning enables one-to-many GPU sharing, Discrete Device Assignment (DDA) represents a contrasting approach, assigning a whole GPU to a single VM. Both technologies are relevant within Azure Stack HCI environments, but they cater to different needs and have distinct operational requirements.

Read More: System Freezes Cause "Not Responding" Errors on PCs

GPU partitioning | E2E Deployment and Operations Guide with Scalable Networking - 2
FeatureGPU Partitioning (GPU-P)Discrete Device Assignment (DDA)
GPU Resource ModelHigh (one GPU to many VMs)Low (one GPU to one VM)
Driver RequirementGPU vendor driver (e.g., NVIDIA) in guestGPU driver in guest
CompatibilitySupported by NVIDIA vGPU softwareBroad hardware support

Crucially, a physical GPU cannot simultaneously function as both a DDA-assigned device and a partitionable GPU. This segregation ensures that the intended operational mode of the GPU is maintained. For users engaging with GPU partitioning, ensuring compatible CPUs, operating systems, and GPUs is paramount, particularly if live migration of VMs is a desired feature. The Azure Local catalog lists supported solutions and GPU models for those adopting these advanced configurations.

Licensing and Driver Nuances

The underlying technology for GPU partitioning, particularly NVIDIA's vGPU, is tied to a specific licensing model. Separate licensing from NVIDIA is a mandatory requirement for utilizing GPU-P features. The installation process for these drivers involves distinct steps for both the host machines and the individual virtual machines. Within Azure Stack HCI, the integration of NVIDIA's vGPU manager and graphics drivers is a critical step in enabling these capabilities. When managing these partitions, information such as the partition ID, VM ID, VRAM allocation, and encode/decode capabilities becomes visible.

Read More: YouTube AI Likeness Detection Tool Now Open to All Creators 18 Plus

Frequently Asked Questions

Q: How does GPU partitioning help businesses save money on server hardware?
GPU partitioning allows one physical GPU to be split among several virtual machines. This means companies do not need to buy a separate expensive GPU for every single virtual machine they run.
Q: What software is needed to set up GPU partitioning on Windows Server 2025?
You need the NVIDIA Virtual GPU Manager and specific graphics drivers from the NVIDIA licensing portal. Windows Admin Center is also recommended to help manage these configurations easily.
Q: What is the main difference between GPU partitioning and Discrete Device Assignment?
GPU partitioning (GPU-P) allows one GPU to be shared by many virtual machines. Discrete Device Assignment (DDA) gives one whole GPU to only one virtual machine.
Q: Can a single physical GPU be used for both partitioning and DDA at the same time?
No, a physical GPU cannot be used for both methods at once. You must choose one mode for the device before you start your setup.