Fix cuda-opengl interop

1. Use cudaPerThreadStream instead of the default streams
2. Since there have been changes to ArrayHandle code, the API to create
ArrayHandle from a device pointer has changed.
5 jobs for fix-cuda-interop in 0 seconds
latest