Use subgroup shuffle for reduction