Use GpuTimers in CUDA and OpenCL versions of NBNXM directly