Implement generic j-reduction in the nbnxm SYCL kernels