This is operated by a business or businesses owned by Informa PLC and all copyright resides with them. Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.
|Session Name:||Optimizing DX12/DXR GPU Workloads using Nsight Graphics: GPU Trace and the Peak-Performance-Percentage (P3) Method (Presented by NVIDIA)|
|Track / Format:||Programming|
|Overview:||This talk shows how Nsight GPU Trace can be used to determine the performance limiters of any DX12 workload on NVIDIA Turing GPUs, and improve performance by applying architecture-aware optimizations. Because the tool captures all of its metrics in a single pass (no frame replay), it can be used on DX12 frames that use asynchronous compute or copy queues. After recaping what the Peak-Performance-Percentage Method is, the talk shows how it can be applied to unlock perf speedups on various workloads, including: compute shaders with large thread-group sizes, pixel shaders with out-of-order completion, ray-tracing BVH updates and ray-tracing denoisers.|