Application Performance Snapshot User's Guide for Linux* OS
Use Application Performance Snapshot for a quick view into a shared memory or MPI application's use of available hardware (CPU, FPU, and memory). Application Performance Snapshot analyzes your application's CPU and FPU usage, I/O and memory footprint, memory access stalls, and MPI and OpenMP* utilization. After analysis, it displays basic performance enhancement opportunities for systems using Intel® platforms. Use this tool as a first step in application performance analysis to get a simple snapshot of key optimization areas.
Application Performance Snapshot is available as a free product download from the Intel® Developer Zone at https://software.intel.com/performance-snapshot and is also available pre-installed as part of Intel® Parallel Studio or Intel® VTune™ Amplifier.
Starting from the 2018 Beta release, the updated Application Performance Snapshot for Linux* OS includes most of the functionality previously available in the MPI Performance Snapshot. MPI Performance Snapshot is no longer available as a separate tool.
This User's Guide documents Application Performance Snapshot 2018 Update 3 and 2019 Beta Update for Linux* OS.
Following is a change log for the current and previous product releases:
The aps-report utility added the --format option, which allows the report to be generated in either text (*.txt) or comma-separated (*.csv) format. The CSV format can be useful for report processing automation or export to spreadsheet programs such as Microsoft Excel*.
The Rank-to-Rank data transfers report was enriched with an aggregated communication time column.
MPI trace file size was compacted with compression and minimal statistic level set by default. Some reports generated by the aps-report utility will be inapplicable with minimal statistic level. See Controlling Amount of Collected Data for more information.
Report generation time with the aps-report utility was significantly improved.
Added the Communicators List chart.
Added the -C, -M options for the Collective Operations Time per Rank chart.
Added data collection control. See Controlling Amount of Collected Data.
Added integration with Intel® Trace Analyzer and Collector. See Creating Configuration File for Intel® Trace Collector.
Removed restrictions for MPI_Pcontrol region numbers.
The tool is now invoked as aps rather than aps.sh.
Result directory change from stat_* to aps_result_*.