Maybe I should get some NVIDIA hardware to test this.
I wonder if Intel oneAPI could be supported too in a similar way...? That would offer cross-platform compatibility for Windows, Linux (and gradually MacOS) for hardware from NVIDIA, AMD and Intel. Therefore, oneAPI looks like a good fit for u++, but of course I don't know the internals.