![]() |
Taskflow
3.2.0-Master-Branch
|
tf::cudaFlow provides a template method, tf::cudaFlow::single_task, for creating a task to run the given callable using a single kernel thread.
You need to include the header file, taskflow/cuda/algorithm/for_each.hpp
, for creating a single-threaded task.
You can create a task to run a kernel function just once, i.e., using one GPU thread. This is handy when you want to set up a single or a few global variables that do not need multiple threads and will be used by multiple kernels afterwards. The following example creates a single-task kernel that sets gpu_variable
to 1.
Since the callable runs on GPU, it must be declared with a __device__
specifier.
The single-task algorithm is also available in tf::cudaFlowCapturer::single_task.