This update serves as a bookend to the Blackwell era. As the industry looks toward 2026, whispers of the Rubin architecture are already circulating. However, NVIDIA’s messaging is clear: CUDA 12.6 is the stable bedrock upon which the AI factories of 2026 will be built.
CUDA 12.6 December introduces . Developers can now define "soft" and "hard" constraints for memory migration. Instead of the driver blindly swapping pages to slow system RAM, developers can now tag specific allocations as "High Priority" (keep on VRAM at all costs) versus "Streamable" (allowed to migrate to CPU RAM or NVMe via GPUDirect Storage transparently). cuda 12.6 update news december 2025