Use cudaLaunchKernel with CUDA 7.0 and later
authorSzilard Pall <pall.szilard@gmail.com>
Fri, 20 Nov 2015 17:09:33 +0000 (18:09 +0100)
committerSzilárd Páll <pall.szilard@gmail.com>
Sat, 6 Feb 2016 01:04:31 +0000 (02:04 +0100)
commit633ae13b37c76a7bcae9cd20cd3d4e35a93f573f
tree01f6fdbcbfe0e18e2ac88e7387f41aecc0b971c7
parent971335dcc39f11cc2e989bdfaf42af446de47a2d
Use cudaLaunchKernel with CUDA 7.0 and later

CUDA 7.0 introduced the cudaLaunchKernel API call similar to the
CUDA driver API and avoids the chevron notation. This has the benefit
of a slight reduction in runtime API overhead (up to 2%) partly
because two runtime API calls that precede the kernel launch are skipped
(cudaSetupArgument and cudaConfigureCall).

For future dev-testing the GMX_DISABLE_CUDALAUNCH env. var. can be used to
force the chevron-notation kernel launch.

Change-Id: Id057fb01489814b99ae290de9e4ddd9f530a04be
docs/user-guide/environment-variables.rst
src/gromacs/mdlib/nbnxn_cuda/nbnxn_cuda.cu