Asynchronous memory allocation for GPUs
Compare changes
- Dave Pugmire authored
@@ -16,10 +16,6 @@
@@ -552,7 +548,7 @@ struct VerifyFill
@@ -565,20 +561,6 @@ VTKM_CONT void RunTests()
Syncronous memory allocation can have a lot of overhead on GPUs. This was noticed when doing tests for multi-block filters. This is supported in cuda versions > 11.3
This is a redo of !2759 (closed). Much has changed since that MR so this will be the replacement.