I'm looking to transition away from our managed service provider and need recommendations for effective monitoring and alerting tools suitable for our on-premise infrastructure. We're a small team managing several servers, virtual machines, and some storage, totaling a few hundred devices. We're not considering AWS solutions as that's handled by our DevOps team. We're open to both paid and open-source options, but something cost-effective would be a plus. Our network team has been exploring openNMS, but I'm looking for suggestions more focused on system monitoring.
5 Answers
For a more advanced setup, integrate Telegraf with InfluxDB and Grafana. It’s unbeatable when it comes to custom monitoring options and strong alerting capabilities.
PRTG has been my go-to choice for monitoring. I’ve had a frustrating experience with Zabbix in the past due to its complicated setup, but others may find Zabbix works well for them.
I suggest giving Uptime Kuma a try. It’s great for basic checks like database or DNS responsiveness.
Zabbix is totally free and well-documented, making it easier to set up, especially if you combine it with a configuration management tool like Puppet or Ansible.
Check out CheckMK! It's effective, though it can be quite chatty right away—be sure to enable the averaging feature when you start.

Related Questions
Can't Load PhpMyadmin On After Server Update
Redirect www to non-www in Apache Conf
How To Check If Your SSL Cert Is SHA 1
Windows TrackPad Gestures