Why Does My ESXi VM Freeze After 30-60 Minutes with GPU Passthrough?

0
19
Asked By TechLover99 On

I'm working on setting up GPU passthrough with ESXi 8.0 U2, and I keep hitting a snag. My VM boots perfectly fine with the GPU assigned, but about 30 minutes to an hour later, it just freezes up completely. When this happens, the VM becomes unresponsive in the vSphere UI, and the only way to recover it is by powering it off. Sometimes, even after shutting it down, I can't get the VM to power back on without rebooting the whole host. Here's a bit about my setup and the troubleshooting I've attempted: I'm using Asus 870e Rog hardware, and I'm testing with NVIDIA A2 and A16 GPUs, all passed through via PCI passthrough. I'm running ESXi 8.0.0 U2. I've made a few tweaks to the VM configuration like disabling svga, playing with the hypervisor flags, and toggling the hot add settings. I've also observed that nvidia-smi doesn't show any info on the host, which I expected since I'm using passthrough. The VM freezes when it's idle or after some usage, but not right after booting. Additionally, I've found logs about the TPM 2.0 device not having the TIS interface active and some NVRM entries. So, I'm really puzzled about what might be causing this freeze after running fine for half an hour or so, and why I need to reboot the host to get it back online.

2 Answers

Answered By GadgetGuru42 On

Have you checked if all your hardware, drivers, and software match the compatibility chart from NVIDIA? If you're not on that list, your setup might not be supported, which could lead to issues like this. It’s definitely something to consider as a first step.

Answered By SysAdminSammy On

Just a heads up, this might not be the right place for your question. You might have better luck posting in communities like r/techsupport, r/vmware, or r/homelab, where folks deal with these kinds of setups daily.

Related Questions

LEAVE A REPLY

Please enter your comment!
Please enter your name here

This site uses Akismet to reduce spam. Learn how your comment data is processed.