I'm facing a challenge while reviewing the backup testing procedures for restoring virtual machines. When we restore VMs, is it sufficient to just check that they can boot up and that I can log in, or should I be validating specific services as well? For instance, if I restore a file server, should I check some files? If so, how many files should be tested? The same goes for SQL servers—does booting the VM suffice, or should I also execute some query tests? For context, our setup uses Veeam for backups, and we have over 300 VMs. Would I need to individually test all of these?
5 Answers
It really varies, but user access testing is crucial. Start with basic checks like:
1. Ensure the VM powers on successfully.
2. Verify the OS boots without issues.
3. Check that authentication works.
4. Look for any disk or filesystem corruption.
For more specific applications (L2 checks), you want to test access to files, so mix it up with a few small and large files across different directories. For SQL servers, make sure the SQL service starts up, check that the databases are online, and run a simple query like 'SELECT COUNT(*) FROM ' to confirm functionality. It's all about User Acceptance Testing (UAT) being done right!
If you’re using Veeam, check out SureBackup. It can automatically validate recovery processes, ensuring your SQL databases are up and running. If you have a Premium license, take advantage of Veeam Recovery Orchestrator. It automates recovery verification and lets you write scripts to verify expected values in your ERP system—all without manually logging into each VM.
What I do is restore our VMs in an isolated environment. Then I connect them, log in, and run some scripts to test their functionality. This method keeps everything contained and ensures we can validate without affecting the main network.
It honestly depends on the server type. Some backup applications let you automate both the restore and the testing processes, which can save a lot of time and effort.
For ransomware recovery, one of the best methods is conducting an 'isolated clean room recovery.' This way, you can ensure safe recovery while testing. If you don't have such a setup, it might be worthwhile to research and implement one, especially because many audits won't pass user-requested recovery tests without it.

Related Questions
Can't Load PhpMyadmin On After Server Update
Redirect www to non-www in Apache Conf
How To Check If Your SSL Cert Is SHA 1
Windows TrackPad Gestures