Do NVIDIA device link for executables
This should not really be necessary for our native cuda device, but the kokkos library needs device link active because it pre-compiles code in its library. Any executable with its own device code needs to run this step. (I'm not sure why this is not happening by default.)