CUDA: Generic all-major should only compile for real architectures except for the latest
NVCC's all-major
compiles for all supported major real architectures and the latest virtual. Our current implementation compiles for all major real and virtual architectures.