![]() |
Taskflow
3.2.0-Master-Branch
|
class to define execution policy for CUDA standard algorithms More...
#include <cuda_execution_policy.hpp>
class to define execution policy for CUDA standard algorithms
NT | number of threads per block |
VT | number of work units per thread |
Execution policy configures the kernel execution parameters in CUDA algorithms. The first template argument, NT
, the number of threads per block should always be a power-of-two number. The second template argument, VT
, the number of work units per thread is recommended to be an odd number to avoid bank conflict.
Details can be referred to Execution Policy.