- Declare kernel globals and associated macros in one place
- Add IntegratorStateGPU for CPU host to access
- Reduce code duplication between CUDA and OptiX
- Make shader sort key part of integrator state template
- Remove unused CPU kernel globals
- Prepare for copying/moving states
- Rename integrator_path_state.h to integrator_state_flow.h, to avoid confusion with kernel_path_state.h.
This includes some changes from D11172 and will cause merge conflicts.
I can wait with merging if that helps.