Is your feature request related to a problem? Please describe.
Currently, we are adding early exits in all bulk functions to avoid launching kernels with 0 threads. This is laborious and should be done in a more systematic way.
Describe the solution you'd like
As mentioned in #151 (review), a kernel launch wrapper is desired to automate skipping kernel launches for 0 threads.
Is your feature request related to a problem? Please describe.
Currently, we are adding early exits in all bulk functions to avoid launching kernels with 0 threads. This is laborious and should be done in a more systematic way.
Describe the solution you'd like
As mentioned in #151 (review), a kernel launch wrapper is desired to automate skipping kernel launches for 0 threads.