Intel® VTune™ Amplifier 201
For help on choosing a proper analysis type for your application, click the
Find Your Analysis link on the Welcome page.
Intel® VTune™ Amplifier introduces the following basic categories of performance analysis:
Predefined analysis types:
Each category represents a branch in the performance analysis tree. You may choose to run any of the available analysis types either from the product graphical interface or using command line interface (amplxe-cl).
To choose and configure an analysis type from GUI:
Open the Analysis Type window using any of the following options:
Click the
New Analysis toolbar button.
Click the
Configure Project toolbar button/select the
Configure Project... option from the product menu and click the
Analysis Type tab.
All analysis types in the VTune Amplifier are based on one of the following data collection types:
Hardware event-based sampling collection (driver-based or driverless mode), optionally extended with the stack collection
Each analysis type provides a set of performance metrics that helps you sort out the problems in your code and understand how to optimize it.
VTune Amplifier supports local and remote collection modes via GUI and command line on remote Android* and Linux* platforms.
The Algorithm Analysis collection introduces analysis types targeted for software tuning. You run the analysis and use the collected data to understand the inefficiencies in your current algorithms, and improve application performance.
Algorithm analysis includes the following analysis types:
Basic Hotspots focuses on a particular target, identifies functions that took the most CPU time to execute, restores the call tree for each function, and shows thread activity.
Advanced Hotspots monitors all the software executing on your system including the operating system modules. The collector interrupts the processor at the specified sampling interval and collects samples of instruction addresses.
Concurrency focuses on a particular target, identifies functions that took the most CPU time to execute, and shows how well your application is threaded for the existing number of logical CPU cores. In Intel System Studio, this analysis type does not support remote data collection on Android systems.
Locks and Waits helps identify the synchronization objects that might cause ineffective CPU usage. In Intel System Studio, this analysis type does not support remote data collection on Android systems.
Memory Consumption analyzes your Linux* native or Python* targets to explore memory consumption (RAM) over time and identify memory objects allocated and released during the analysis run.
The Compute-Intensive Application Analysis branch introduces analysis types based on applications that are compute-sensitive. They can be used as a starting point for overall application performance analysis before moving on to more targeted analysis types.
Compute-intensive application analysis includes the following analysis types:
HPC Performance Characterization evaluates compute-sensitive or throughput applications for floating point operation and memory efficiency. It can be used as a starting point for understanding overall application performance.
The Microarchitecture Analysis collection introduces analysis types that help you estimate how effectively you code runs on modern hardware.
General Exploration helps identify the most significant hardware issues affecting the performance of your application. Consider this analysis type as a starting point when you do hardware-level analysis.
Memory Access measures a set of metrics to identify memory access related issues (for example, specific to NUMA architectures).
SGX Hotspots helps identify performance-critical program units inside security enclaves on systems with Intel Software Guard Extensions (Intel SGX) feature enabled.
TSX Exploration is targeted for Intel processors supporting Intel Transactional Synchronization Extensions (Intel TSX). It helps analyze Intel TSX usage and causes of transactional aborts.
TSX Hotspots is targeted for Intel processors supporting Intel TSX. It helps analyze hotspots inside transactions.
The Platform Analysis branch introduces analysis types monitoring CPU, GPU and power usage for your application/system.
CPU/GPU Concurrency enables you to explore code execution on the various CPU and GPU cores on your platform, correlate CPU and GPU activity and identify whether your application is GPU or CPU bound.
GPU Hotspots is targeted for applications using a Graphics Processing Unit (GPU) for rendering, video processing, and computations. If you ran the CPU/GPU Concurrency analysis and identified that your application is GPU-bound, use the GPU Hotspots analysis to go deeper and identify the most time-consuming GPU computing tasks and analyze performance per GPU hardware metrics.
GPU In-Kernel Profiling is targeted for GPU-bound applications and helps analyze GPU kernel execution per code line and identify performance issues caused by memory latency or inefficient kernel algorithms. This analysis type incurs a higher performance overhead.
System Overview is a driverless event-based sampling analysis that monitors general behavior of your target Linux* or Android* system and correlates power and performance metrics with the interrupt request (IRQ) handling.
Disk Input and Output (preview on Windows OS) analysis monitors utilization of the disk subsystem, CPU and processor buses.
CPU/FPGA Interaction (preview) analysis explores FPGA utilization for each FPGA accelerator and identifies the most time-consuming FPGA computing tasks.
Graphics Rendering (preview) analysis is targeted to estimate the CPU/GPU utilization of your code running on the Xen virtualization platform.
A PREVIEW FEATURE may or may not appear in a future production release. It is available for your use in the hopes that you will provide feedback on its usefulness and help determine its future. Data collected with a preview feature is not guaranteed to be backward compatible with future releases. Please send your feedback to parallel.studio.support@intel.com.
All analysis types provided by the VTune Amplifier in the Analysis Type window are predefined by Intel architects and contain a default set of options. You may create custom analysis types based on the existing predefined analysis configurations.