cuda 12.7 release news

Cuda 12.7 Release | News

Historically, kernel attributes (like shared memory configuration or max dynamic shared memory) were often set via separate driver calls or boilerplate code. cudaLaunchKernelEx allows developers to pass an extended attribute structure directly at launch time.

CUDA 12.7 is a significant update that offers a range of new features, improvements, and enhancements for NVIDIA developers. With improved performance, better support for deep learning frameworks, and enhanced multi-GPU capabilities, this release is likely to be popular among developers building GPU-accelerated applications. If you're an NVIDIA developer, be sure to check out CUDA 12.7 and take advantage of its many features and improvements.

In the past, launching kernels from within other kernels (CDP) was a useful but heavy operation. With 12.7, NVIDIA has introduced a "lightweight" grid launch capability. This reduces the overhead of nested launches significantly.

CUDA 12.7 introduces a major upgrade for .