Improved CUDA non-bonded kernel performance