Fix CUDA inter-stream synchronization issue