Intel® VTune™ Amplifier
Use a platform-wide System Overview analysis to monitor a general behavior of your target Linux* or Android* system and correlate power and performance metrics with the interrupt request (IRQ) handling.
This analysis type uses the Perf*-based driverless sampling collection that is based on NMI and enables analyzing IRQ handlers code even if IRQs are masked.
For Linux targets, the System Overview analysis collects the following Ftrace* events: sched, freq, idle, workq, irq, softirq.
For Android targets, the System Overview analysis collects the following events:
Atrace* events: input, view, webview, audio, video, camera, hal, res, dalvik
Ftrace events: sched, freq, idle, workq, filesystem, irq, softirq, sync, disk
To use the System Overview analysis, explore:
Configuration options (knobs)
Viewpoints
To configure options for the System Overview analysis:
Prerequisites: Create a project and specify an analysis target.
Click the
New Analysis button on the
Intel® VTune™ Amplifier toolbar.
The New Amplifier Result tab opens with the Analysis Type window active.
From the analysis tree on the left pane, select Platform Analysis > System Overview
The analysis configuration pane opens on the right.
Configure the following options:
CPU sampling interval, ms field |
Specify an interval (in milliseconds) between CPU samples. Possible values - 0.01-1000. The default value is 0.10000000000000001. |
Collection Level options |
Select a level of details provided with event-based sampling collection. Detailed collection levels cause higher overhead.
The default value is Hotspots. |
Details button |
Expand/collapse a section listing the default non-editable settings used for this analysis type. If you want to modify or enable additional settings for the analysis, you need to create a custom configuration by copying an existing predefined configuration. VTune Amplifier creates an editable copy of this analysis type configuration and locates it under the Custom Analysis section on the left pane. |
You may generate the command line for this configuration using the Command Line... button at the bottom.
Click Start to run the analysis.
VTune Amplifier collects the data, generates a rxxxso result, and opens it in the default Hotspots viewpoint. For System Overview analysis, start with the Top Interrupt Handlers and Interrupt Duration Histogram sections of the Summary window. Identify slow interrupt handlers and switch to the grid view grouped by Interrupt/Interrupt Duration Type/Function/Call Stack for further interrupts analysis.
You can also choose to view System Overview analysis results from different perspectives using the following viewpoints:
Viewpoint |
Description |
---|---|
Hardware Issues |
Helps identify where the application is not making the best use of available hardware resources. This viewpoint displays metrics derived from hardware performance counters. Hover over the highlighted metrics values in the grid to read why the extreme value might represent a performance problem. |
Hardware Events |
Displays statistics of monitored hardware events: estimated count and/or the number of samples collected. Use this view to identify code regions (modules, functions, code lines, and so on) with the highest activity for an event of interest. |
Platform Power Analysis |
Helps identify where the application is generating idle and wake-up behavior that can lead to inefficient use of energy. Where possible, it provides data from both the OS and hardware perspective, such as the detailed C-state residency report that shows the OS requested time in deep sleep states compared to the actual residency the hardware indicated. |
HPC Performance Characterization |
Helps understand how effectively your application uses CPU, memory, and floating-point operation resources. Use this view to identify scalability issues for Intel OpenMP and MPI runtimes as well as next steps to increase memory and FPU efficiency. |
These viewpoints may include the following windows:
Summary window displays statistics on the overall application execution.
Event Count window displays the event count for all processor events selected for the analysis. This view provides an estimated number of times an event occurred during the collection.
Sample Count window displays the sample count for all collected processor events. This view provides the actual number of samples collected for an event.
Uncore Event Count window displays counts of uncore events selected for the analysis. If there are no uncore events, the upper pane of the window is empty.
Caller/Callee window displays parent and child functions of the selected focus function. This window is available only if stack collection was enabled during analysis configuration.
Top-down Tree window displays hotspot functions in the call tree, performance metrics for a function only (Self value) and for a function and its children together (Total value).
Platform window provides details on CPU and GPU utilization, frame rate, memory bandwidth, and user tasks (if corresponding metrics are collected).
Graphics window displays overtime data as GPU usage on a software queue, CPU time usage, OpenCL™ kernels data, and GPU performance per the Overview group of GPU hardware metrics, Memory Bandwidth, and CPU Frequency.