Session Name: | Optimizing DX12/DXR GPU Workloads using Nsight Graphics: GPU Trace and the Peak-Performance-Percentage (P3) Method (Presented by NVIDIA) |
Speaker(s): | Louis Bavoil |
Company Name(s): | NVIDIA |
Track / Format: | Programming |
Overview: | This talk shows how Nsight GPU Trace can be used to determine the performance limiters of any DX12 workload on NVIDIA Turing GPUs, and improve performance by applying architecture-aware optimizations. Because the tool captures all of its metrics in a single pass (no frame replay), it can be used on DX12 frames that use asynchronous compute or copy queues. After recaping what the Peak-Performance-Percentage Method is, the talk shows how it can be applied to unlock perf speedups on various workloads, including: compute shaders with large thread-group sizes, pixel shaders with out-of-order completion, ray-tracing BVH updates and ray-tracing denoisers. |