CUDA: CMAKE_INTERPROCEDURAL_OPTIMIZATION
With CUDA 11.2+, nvcc adds support for device LTO: https://developer.nvidia.com/blog/improving-gpu-app-performance-with-cuda-11-2-device-lto/
I think we need to add support for the CMAKE_INTERPROCEDURAL_OPTIMIZATION
attribute to add according flags to CUDA targets in CMake.
Edited by Robert Maynard