Intel® Advisor Help
Purpose and Usage | Report Regions
Run a Survey analysis to generate a Survey Report that offers integrated compiler report data and performance data for your target application all in one place. Optionally run a Trip Counts analysis and/or FLOP analysis to add data to the Survey Report. The Roofline analysis runs a Survey analysis followed by a FLOP analysis automatically.
Survey analysis - Identifies:
Where vectorization, or parallelization with threads, will pay off the most
If vectorized loops are providing benefit, and if not, why not
Un-vectorized loops and why they are not vectorized
Performance problems in general
Trip Counts analysis - Dynamically identifies the number of times loops and functions are invoked and executed (also called call count/loop count and iteration count respectively). Use Trip Counts data to:
Detect loops with too-small trip counts and trip counts that are not a multiple of vector length.
Analyze parallelism granularity more deeply.
FLOP analysis - Dynamically measures floating-point and integer operations, and memory traffic. Use the FLOP analysis to generate application memory usage and performance values that help you make better decisions about your vectorization strategy.
Roofline analysis - Helps you visualize actual performance against hardware-imposed performance ceilings, as well as determine the main limiting factor (memory bandwidth or compute capacity), thereby providing an ideal roadmap of potential optimization steps.
Use the Roofline chart to answer the following questions:
What is the maximum achievable performance with your current hardware resources?
Does your application work optimally on current hardware resources?
If not, what are the best candidates for optimization?
Is memory bandwidth or compute capacity limiting performance for each optimization candidate?
1 |
Filters pane - Filter analysis data by a variety of criteria, such as module, loop/function, and vectorized/non-vectorized. |
2 |
Roofline Chart pane - visualize actual performance against hardware-imposed performance ceilings, as well as determine the main limiting factor (memory bandwidth or compute capacity), thereby providing an ideal roadmap of potential optimization steps. |
3 |
Loop Information pane - View integrated compiler report data and Intel Advisor performance data for target application loops, and mark a loop for deeper analysis. |
4 |
Advanced View pane - View more information for a loop selected in the Loop Information pane. |
Use the Survey Source window to view details about a code region. Do one of the following to access the Survey Source window:
Double-click a row in the Loop Information pane or in the Top Down tab of the Advanced View pane.
Right-click a loop and select View Source.