Allow x D2H to overlap with GPU force compute