Is Anyone Else Facing Silent Failures with ArgoCD and Crossplane?

0
3
Asked By TechieTurtle42 On

I've been on an absolute rollercoaster trying to troubleshoot an incredibly frustrating issue with ArgoCD and Crossplane. Here's what I'm facing: ArgoCD consistently indicates that resources are "Healthy" and "Synced," yet Crossplane is having major trouble provisioning AWS resources. We're receiving countless 400 errors from AWS, causing things like Lambda functions to not update and RDS instances to get stuck. It's like ArgoCD is serving as a false beacon of hope while everything else is crumbling.

I've spent a significant amount of time researching and I've found no documentation or discussion that addresses this issue directly. It's baffling! Through my investigation, I discovered that the health check logic in Crossplane is flawed. Conditions are processed in an array, leading ArgoCD to declare resources healthy when it's not necessarily true, as long as 'Ready: True' appears before any failing conditions.

Is anyone else dealing with this absurdity? Are we all just overlooking the health checks with Crossplane, or is my setup unusually cursed? I managed to circumvent the problem by rearranging the condition checks, but I'm shocked that this isn't better known.

If this strikes a chord with anyone, please let me know!

5 Answers

Answered By DevOpsDude On

You should really consider sending a patch instead of just working around the problem. Fixing the issue directly might prevent it from affecting others in the future.

Answered By CuriousCoder99 On

It's great that you found a workaround! But just a heads up, Medium articles that are "Member-only" can be frustrating. It might be better to share your findings somewhere more accessible, like GitHub.

UserFriendlyGuy88 -

Yeah, Medium can be a pain, especially with member restrictions.

RandomDev007 -

I generally avoid Medium too. It's likely going to limit who can read about your issue.

Answered By DevExpert21 On

It sounds like there's a bit of misunderstanding about how GitOps works with Argo. The resources may be synced, but that doesn't mean they're healthy. Argo's job is primarily to ensure that what's in your cluster matches the desired state, not to guarantee everything is completely operational. You should implement additional monitoring tools for a full health overview, like Grafana or Datadog, which can track the real-time state of your AWS resources.

TechieTurtle42 -

I see what you're saying! I think I focused too much on Argo's output and not enough on monitoring tools.

SystemWatcher22 -

Exactly! Relying solely on Argo for health checks could lead to these kinds of oversights.

Answered By ProblemSolver101 On

If you're dealing with these kinds of issues, perhaps consider filing a GitHub issue instead of just writing it up somewhere else. It seems like this could benefit many users without them realizing it.

TechieTurtle42 -

That’s a good suggestion! I did mention it to the maintainers, but it feels more like a community issue right now.

Answered By ThatWiseDude On

Honestly, I've seen folks hit this issue before. Fortunately, I had learned about Argo's health check behavior beforehand, so it didn't surprise me. I assumed anyone using Argo would know to test custom health checks when the defaults don't work as expected.

TechieTurtle42 -

Right? I feel like this info should be more common knowledge!

Related Questions

LEAVE A REPLY

Please enter your comment!
Please enter your name here

This site uses Akismet to reduce spam. Learn how your comment data is processed.