Hey there! I'm currently managing a Docker Swarm cluster with around 13 nodes and roughly 300 services, and I'm in need of some solid tools to help me gather traces, logs, and metrics. I've already tried Uptrace and SigNoz, but they didn't meet my expectations due to stability issues for a system of this scale. Here's what I'm looking for in the solutions: they should be open-source, free to self-host, work seamlessly with Docker Swarm, support metrics, logs, and distributed traces, and be scalable and reliable enough for managing 300 services. Any recommendations?
3 Answers
Coroot is another great option to check out. It has some nice features for monitoring that might suit your needs well!
Have you considered using Vector along with Prometheus or VictoriaMetrics for metrics collection, paired with Grafana for visualization? It might be a solid choice for your setup. Traces are a bit trickier, especially if you want to use eBPF tools, but that's something to look into.
The classic stack of Prometheus, Loki, and Jaeger could work well for your requirements. While it does involve managing multiple components, Grafana has a tool called Alloy (formerly known as Agent) that can simplify the integration. Just a heads up: running monitoring for 300 services might introduce real costs in terms of infrastructure and the time needed to maintain everything, even if it's open-source and free.

Related Questions
Can't Load PhpMyadmin On After Server Update
Redirect www to non-www in Apache Conf
How To Check If Your SSL Cert Is SHA 1
Windows TrackPad Gestures