Recently, I was asked by a VP about a significant increase in our machine learning team's spending on Azure. I spent three days going through resource tags that were mostly empty, trying to make sense of subscription sprawl driven by differing naming conventions and confusing cost allocation rules. I eventually discovered that a developer had created a GPU cluster for testing and forgot about it, tagging it to multiple cost centers due to the absence of a proper tagging policy. However, I believe the underlying issue is the lack of transparency in Azure's cost attribution. It's really tough to trace expenses back to specific business units or applications without constructing a complex attribution layer ourselves. I'm curious to know how others in the community are managing this situation. There must be a more efficient way to handle it!
5 Answers
Honestly, proper tagging is essential for managing costs efficiently in Azure. You should be developing a clear tagging policy that everyone adheres to from the outset. Establishing budget alerts and restricting who can create subscriptions are key practices. Having a single ownership model for subscriptions and applying tags at deployment can drastically simplify your cost attribution woes!
It sounds like a lot of the challenges you're facing are due to your current governance setup rather than Azure itself. If your teams are accountable for their own subscriptions, they’ll be less likely to let costs spiral out of control. You might want to set clear restrictions on who can build resources where—not just at the subscription level but also within resource groups. That's how you can roll up costs easily. Also, having a director or manager oversee each subscription means they’ve got skin in the game to ensure proper use of resources.
Don’t underestimate the power of alerting systems. For newly created resource groups lacking proper tags, we send automatic alerts to the admins; they follow a set policy to address issues promptly. This also helps keep everyone accountable. Setting clear responsibilities around tags can prevent problems before they escalate. A lot of this comes down to creating a culture of compliance in your teams.
The problem seems to stem from a lack of proper governance rather than Azure's design. You should definitely be enforcing mandatory tagging through Azure Policy whenever resources are deployed, and implementing budget alerts for every subscription can also help. It’s really important to have the right role-based access controls in place. For your untagged GPU cluster issue, developing an attribution layer could help track ownership and spending effectively, plus it saves headaches in the long run. You don’t want to keep scrambling after the fact.
It seems there’s a fundamental issue with how costs are tracked in Azure. Building a more effective attribution layer is a good move. We use something like pointfive to help with tracking expenditures and tying them back to business units. The reality is that Azure doesn’t provide straightforward visibility into who’s spending what; you’ll need either external tools or a solid governance strategy in place.

Related Questions
Biggest Problem With Suno AI Audio
Ethernet Signal Loss Calculator
Sports Team Randomizer
10 Uses For An Old Smartphone
Midjourney Launches An Exciting New Feature for Their Image AI
ShortlyAI Review