Separate tile buffers on all devices only need to exist when denoising is active (so any overlap being rendered simultaneously does not write to the same memory region).
When denoising is not active they can be distributed like all other memory when peer memory support is available.
I checked that there are no problems with this on a multi-GPU system (without peer memory support though) and denoising on/off.