tf::syclFlow provides a template method, tf::syclFlow::single_task, for creating a task to run the given callable using a single kernel thread.
Run a Task with a Single Thread
You can create a task to run a kernel function just once, i.e., using one GPU thread. This is handy when you want to set up a single or a few global variables that do not need multiple threads and will be used by multiple kernels afterwards. The following example creates a single-task kernel that sets gpu_variable
to 1.
sycl::queue queue;
int* gpu_variable = sycl::malloc_shared<int>(1, queue);
[gpu_variable] () { *gpu_variable = 1; }
);
sycl::range<1>(N), [=] (sycl::id<1> id) { data1[id] *= gpu_variable; }
);
}, queue);
class to create a task handle over a node in a taskflow graph
Definition task.hpp:187
class for building a SYCL task dependency graph
Definition syclflow.hpp:23
syclTask single_task(F &&func)
invokes a SYCL kernel function using only one thread
Definition syclflow.hpp:492
syclTask parallel_for(ArgsT &&... args)
creates a kernel task
Definition syclflow.hpp:500
handle to a node of the internal CUDA graph
Definition sycl_task.hpp:21
syclTask & precede(Ts &&... tasks)
adds precedence links from this to other tasks
Definition sycl_task.hpp:131