Properly use CUDA signbit functions.

1022 jobs for cuda_signbit_guards