BioD PNPI Git Repos - alexxy/gromacs.git/commit

author	Roland Schulz <roland.schulz@intel.com>
	Thu, 29 Mar 2018 01:55:52 +0000 (18:55 -0700)
committer	Roland Schulz <roland.schulz@intel.com>
	Tue, 15 May 2018 16:25:09 +0000 (09:25 -0700)
commit	a168d2e692fa1228998e222076de8f47956569b3
tree	c7b9ae92d89e6990138baa9d61470c04b1e3a649	tree \| snapshot
parent	2f839a8152d7c152f35e254bfce262b2bbc4c033	commit \| diff

Allow OCL CL_SIZE to be set to 4 for Intel

Add GMX_OCL_CLUSTER_SIZE which can be set to 4 for e.g. Intel.
The kernel should now work on any HW with at least
CL_SIZE*CL_SIZE/2 wide sub-groups (warp-sync execution).
This is 8(/32) for CL_SIZE 4(/8). Not tested for CL_SIZE other
than 4 or 8.

Fixes:
- make_fep_list_supersub was incorrect for CL_SIZE!=8.
- reduce_force_i_pow2 was incorrect for CL_SIZE<8 and 2 warps.
- i-atom preload, nbnxn_excl_t, warp-any init for CL_SIZE!=8.
- gpu_ref for CL_SIZE!=8.

Change-Id: I1114e408d28b9eb6306722c41fd6a6ccec52211b

cmake/gmxManageOpenCL.cmake		diff \| blob \| history
src/config.h.cmakein		diff \| blob \| history
src/gromacs/gpu_utils/ocl_compiler.cpp		diff \| blob \| history
src/gromacs/mdlib/nbnxn_kernels/nbnxn_kernel_gpu_ref.cpp		diff \| blob \| history
src/gromacs/mdlib/nbnxn_ocl/nbnxn_ocl_kernel.clh		diff \| blob \| history
src/gromacs/mdlib/nbnxn_ocl/nbnxn_ocl_kernel_pruneonly.clh		diff \| blob \| history
src/gromacs/mdlib/nbnxn_ocl/nbnxn_ocl_kernel_utils.clh		diff \| blob \| history
src/gromacs/mdlib/nbnxn_pairlist.h		diff \| blob \| history
src/gromacs/mdlib/nbnxn_search.cpp		diff \| blob \| history