CUDA currently doesn't support building for compute_ and having compiled in virtuals ( using separable compilation ). So we need to transition everything over to sm_
compute_
sm_