CUDA based worklets should now not timeout.
The first CUDA worklet test requires way more time because of the overhead to allow the driver to convert the kernel code from virtual arch to actual arch.
The first CUDA worklet test requires way more time because of the overhead to allow the driver to convert the kernel code from virtual arch to actual arch.