Allow disabling cj prefetch in the CUDA nbnxm kernels