We're dealing with some significant challenges related to configuration drift across our various environments, especially since multiple teams are making deployments. Keeping everything in sync and compliant with our standards has become quite a task. I'm looking for effective strategies and tools you use to manage this issue. What proactive or reactive approaches have worked for you?
5 Answers
Some folks argue that Kubernetes is overkill for most companies, but it really addresses these types of issues effectively. Tools like Argo or Flux can keep everything synchronized, automatically reverting any changes that aren't approved. For example, if a developer adjusts a config map, Argo can revert it back within 30 seconds! There are other tools like Chef, Ansible, or Salt, but those require you to run them on a schedule to keep things in sync.
We primarily use source control for changes, which simplifies management since I'm often the only one making updates. It also helps to manage tools like Puppet in source control to keep everything organized.
The key for us is having a centralized automated pipeline to deploy any changes triggered by a Git push. No one gets admin access unless it's an emergency, which keeps drift at bay. Plus, we have daily GitHub Actions that run with a Terraform plan to monitor for any unintended changes, helping us catch any issues early.
We run a daily Ansible job that helps align everything efficiently back to our desired states. It's a straightforward approach that keeps our configurations under control.
If you're using Kubernetes, the best strategy is to implement strict GitOps with Argo or Flux and enable autosync for anything in the cluster. It's crucial that only the ops team has direct access to the cluster to minimize drift. For broader infrastructure with multiple components, although Terraform is theoretically a great solution, in practice it’s tough to keep everything aligned. Small changes, like shifts in provider APIs, often lead to drift.

Related Questions
Can't Load PhpMyadmin On After Server Update
Redirect www to non-www in Apache Conf
How To Check If Your SSL Cert Is SHA 1
Windows TrackPad Gestures