Add missing cycle counting for GPU halo exchange calls