Performance Debugging with Intel® SDK for OpenCL™ Applications
Performance Debugging Introduction
Host-Side Timing
Profiling Operations Using OpenCL Profiling Events
Comparing OpenCL™ and Native Code Performance
Getting Credible Performance Numbers
Tools for OpenCL™ Development