I'm working on setting up monitoring for our infrastructure, which consists of four EKS clusters housing about 15-20 pods each. I'm at a crossroads deciding whether to go with Amazon's native CloudWatch for my dashboards, alerts, and metrics or to use a stack with Prometheus and Grafana. I'm curious about a few things:
- What are the downsides of just sticking with CloudWatch? Is it going to be significantly pricier compared to Prometheus and Grafana?
- Is there anyone here who relies on CloudWatch as their primary monitoring for EKS? What has your experience been like?
I'm aware CloudWatch may come with higher costs, but I'm also considering the time savings that would come from not having to manage an open-source stack. Any thoughts or recommendations are welcome!
4 Answers
From what I've seen, CloudWatch logs can incur steep costs. Prometheus and Grafana are widely regarded as superior tools for Kubernetes monitoring with richer ecosystems. Plus, it could be beneficial in the long run to get familiar with them since many companies use them. But if you're looking to get metrics from AWS, CloudWatch might still be necessary to kick things off.
CloudWatch can get pricey really quickly—especially with custom metrics. Each custom metric costs around 30 cents and that can multiply depending on how you’re using it, possibly leading to thousands of dollars in expenses if you’re not careful. I’d suggest leaning towards Grafana and Prometheus instead if you're concerned about costs; they offer better control and a more rewarding setup process.
Honestly, running four clusters for just 15-20 pods each seems excessive. It might be worth considering consolidating those clusters to cut down on costs. If you stick with CloudWatch, definitely think about using Container Insights to get pod-level metrics. Just be prepared for the potential increase in metric volume and payments.
For a setup like yours, CloudWatch is likely easier to manage, but the bills will be something to watch out for. Each enabled metric in CloudWatch can add up quickly, especially if you’re tracking usage per pod and node. Managed services like Amazon Managed Prometheus and Grafana might be worth a look since they provide out-of-the-box monitoring for EKS at a potentially reduced cost.

Related Questions
Can't Load PhpMyadmin On After Server Update
Redirect www to non-www in Apache Conf
How To Check If Your SSL Cert Is SHA 1
Windows TrackPad Gestures