Multi-node CUDA does not work.
When trying to run the CUDA benchmark on multiple systems it fails. Looks like this is out-of-scope for the project though. Maybe making this work in the future is of interest, although there is already the option to use GPUs for the various other benchmarks using --use-gpus
.