Coding for the IntelŪ Architecture Processors
Introduction for OpenCL Coding on IntelŪ Architecture Processors
Vectorization Basics for IntelŪ Architecture Processors
Vectorization: SIMD Processing Within a Work-group
Benefitting from Implicit Vectorization
Vectorizer Knobs
Targeting a Different CPU Architecture
Using Vector Data Types
Writing Kernels to Directly Target the IntelŪ Architecture Processors
Work-Group Size Considerations
Threading: Achieving Work-Group Level Parallelism
Efficient Data Layout
Using the Blocking Technique
IntelŪ Turbo Boost Technology Support
Global Memory Size