PME reduction for CUDA F buffer operations