We’re standing up some docker-based Nvidia GPU compute workloads for the rapids.ai ecosystem for replacing/accelerating Spark & friends. However, we’re lost in Nutanix GPU virtualization docs, so curious if folks have ideas on the pieces for nutanix to work here.
Right now, we’re thinking P100/V100 GPU → ahv / esxi → rhel 8.x → docker, and as an optional reach target, see if we can guest multiple host OS’s to share the same GPU(s). We’ve successfully done GPU → Ubuntu+RHEL → docker, but without ahv/esxi in the mix. Most ahv+esxi gpu articles seem more about VDI than compute, so we’re uncertain.
Experiences? Ideas? Tips?