Profiling OpenCL™ Applications
The profiling of OpenCL™ application with Intel® VTune™ Amplifier XE 2018 is similar to profiling any native or offload application on the Intel® Xeon Phi™ coprocessors.
To profile your OpenCL™ application, do the following:
- Install the sampling driver.
- Create a VTune Amplifier XE 2018 project.
- Run a Advanced hotspots collection.
For more information regarding the steps above, refer to the “Intel® Xeon Phi™ Processor Targets” section of the Intel VTune Amplifier 2018 User’s Guide.
See Also
Threading: Achieving Parallelism Between Work-Groups
Utilizing Software Prefetching
Efficient Data Layout
Use Lower Math Precision
Use Branching Accurately
Developer Guide for Intel® SDK for OpenCL™ Applications
Optimization and Performance Tuning for Intel® Xeon Phi™ Coprocessors, Part 2
Intel® Xeon Phi™ Processor Targets