Add wallcycle counting to StatePropagatorDataGpu