CUDA: Pass toolkit path to Clang
Clang isn't very good at finding the installed CUDA toolkit. The upstream recommendation is that we should pass the toolkit explicitly.
Avoids Clang having to search for the toolkit on every invocation. Allows the user to use a toolkit from a non-standard location by simply setting CUDAToolkit_ROOT.
Implements: #20754 (closed)