GPU Partitioning for Windows Server 2025 and Azure Stack HCI Explained

New GPU partitioning technology lets you split one GPU between many virtual machines. This is more efficient than the old way of using one GPU for one machine.

The capacity to divvy up a single physical graphics processing unit (GPU) among multiple virtual machines (VMs) has solidified as a functional technology, enabling a more granular approach to resource allocation. This 'GPU partitioning' (GPU-P) strategy, primarily driven by NVIDIA's vGPU software, allows a single GPU to be shared, offering a fraction of its power to each VM rather than dedicating the entire unit to one.

This capability is not a hypothetical future but a deployed reality. For users of Microsoft Azure Stack HCI and Windows Server 2025, the architecture for GPU partitioning is laid out in operational guides. These systems require specific host and guest drivers, with NVIDIA's Virtual GPU Manager and corresponding graphics drivers being central to the process. Dell Technologies, a prominent player in hardware solutions, specifically points to its servers' need for separate GPU-P drivers and advocates for Windows Admin Center (WAC) as a streamlined tool for managing these configurations. NVIDIA's licensing portal is identified as the source for these necessary drivers.

GPU partitioning | E2E Deployment and Operations Guide with Scalable Networking - 1

The implementation of GPU partitioning extends across various platforms. On Hyper-V, the virtualization technology underpinning Windows Server, partitioning can be managed through Windows Admin Center or PowerShell. Projects like 'Easy-GPU-PV' on GitHub are actively seeking to simplify this process on Windows Hyper-V environments, automating VM creation and driver installations alongside GPU partitioning. The ability to assign only a single GPU partition to any given VM is a consistent limitation across these systems.

Divergent Paths: GPU-P versus Discrete Device Assignment

While GPU partitioning enables one-to-many GPU sharing, Discrete Device Assignment (DDA) represents a contrasting approach, assigning a whole GPU to a single VM. Both technologies are relevant within Azure Stack HCI environments, but they cater to different needs and have distinct operational requirements.

GPU partitioning | E2E Deployment and Operations Guide with Scalable Networking - 2

Feature	GPU Partitioning (GPU-P)	Discrete Device Assignment (DDA)
GPU Resource Model	High (one GPU to many VMs)	Low (one GPU to one VM)
Driver Requirement	GPU vendor driver (e.g., NVIDIA) in guest	GPU driver in guest
Compatibility	Supported by NVIDIA vGPU software	Broad hardware support

Crucially, a physical GPU cannot simultaneously function as both a DDA-assigned device and a partitionable GPU. This segregation ensures that the intended operational mode of the GPU is maintained. For users engaging with GPU partitioning, ensuring compatible CPUs, operating systems, and GPUs is paramount, particularly if live migration of VMs is a desired feature. The Azure Local catalog lists supported solutions and GPU models for those adopting these advanced configurations.

Licensing and Driver Nuances

The underlying technology for GPU partitioning, particularly NVIDIA's vGPU, is tied to a specific licensing model. Separate licensing from NVIDIA is a mandatory requirement for utilizing GPU-P features. The installation process for these drivers involves distinct steps for both the host machines and the individual virtual machines. Within Azure Stack HCI, the integration of NVIDIA's vGPU manager and graphics drivers is a critical step in enabling these capabilities. When managing these partitions, information such as the partition ID, VM ID, VRAM allocation, and encode/decode capabilities becomes visible.

Frequently Asked Questions

Q: How does GPU partitioning help businesses save money on server hardware?

GPU partitioning allows one physical GPU to be split among several virtual machines. This means companies do not need to buy a separate expensive GPU for every single virtual machine they run.

Q: What software is needed to set up GPU partitioning on Windows Server 2025?

You need the NVIDIA Virtual GPU Manager and specific graphics drivers from the NVIDIA licensing portal. Windows Admin Center is also recommended to help manage these configurations easily.

Q: What is the main difference between GPU partitioning and Discrete Device Assignment?

GPU partitioning (GPU-P) allows one GPU to be shared by many virtual machines. Discrete Device Assignment (DDA) gives one whole GPU to only one virtual machine.

Q: Can a single physical GPU be used for both partitioning and DDA at the same time?

No, a physical GPU cannot be used for both methods at once. You must choose one mode for the device before you start your setup.

GPU Partitioning for Windows Server 2025 and Azure Stack HCI Explained

Divergent Paths: GPU-P versus Discrete Device Assignment

Licensing and Driver Nuances

Frequently Asked Questions

NewsRadar

The Present

Search Records

Explore

GPU Partitioning for Windows Server 2025 and Azure Stack HCI Explained

Divergent Paths: GPU-P versus Discrete Device Assignment

Licensing and Driver Nuances

Frequently Asked Questions

Know What Changed

NVIDIA releases SANA-WM video model on May 17 2026 for faster creation

System Freezes Cause "Not Responding" Errors on PCs

YouTube AI Likeness Detection Tool Now Open to All Creators 18 Plus

NVIDIA cuObjClient API Specification Released for Faster GPU Data Access

Senate Judiciary Committee calls tech CEOs to testify in June 2026

Are SSDs Memory? Flash Storage Explained for Computer Users

NewsRadar

The Present

Search Records

Explore