Intel® VTune™ Amplifier

graphics-rendering Command Line Analysis

Use the graphics-rendering value to launch the GPU Rendering analysis (preview) and estimate your code performance based on the GPU usage per engine and GPU hardware metrics.

It focuses on the following usage models:

How It Works

Note

This analysis type is available on the processors based on Intel® microarchitecture code name Broadwell and later.

The GPU In-kernel Profiling instruments your code and, depending on your configuration settings, helps identify performance-critical basic blocks or issues caused by memory accesses in the GPU kernels.

Since the GPU In-kernel Profiling incurs higher performance overhead than the GPU Compute/Media Hotspots analysis, you may consider first running the GPU Compute/Media Hotspots analysis to identify the hottest GPU computing task (GPU kernel) and then exploring this kernel with the GPU In-kernel Profiling.

GPU In-kernel profiling introduces the following key metrics:

Syntax

$ amplxe-cl [--target-system=ssh:username@hostname[:port]]--collect graphics-rendering [--knob <knobName=knobValue>] -- [target] [target_options]

Knobs: gpu-sampling-interval, gpu-counters-mode=render-basic.

Note

For the most current information on available knobs (configuration options) for the GPU Rendering, enter:

$ amplxe-cl -help collect graphics-rendering

Example

This example runs system-wide GPU Rendering analysis for a remote Xen target:

host>./amplxe-cl --target-system=ssh:user1@172.16.254.1 –-collect graphics-rendering --duration 0

This example profiles an OpenGL-ES app running the GPU Rendering analysis:

host>./amplxe-cl –-collect graphics-rendering --target-process process1

What's Next

When the data collection is complete, do one of the following to view the result:

See Also