I'm really curious about how to handle the content structure on sites with more than 200 pages, especially from an architectural perspective. On content-rich platforms like documentation sites, blogs, or resource hubs, I've noticed that things tend to get disorganized as more pages are added. Common issues arise such as overlapping routes that target similar intents, thin or redundant tag systems, related content components that auto-link randomly, and an overall unclear page hierarchy.
From a web development standpoint, do you enforce a structure at the routing level, or is it generally the responsibility of the CMS or editorial team? I'm particularly interested in how to prevent long-term structural drift, whether you conduct audits of internal link graphs regularly, and if you treat content architecture like traditional system design or let it develop organically over time. I'd love to hear how others approach these challenges!
3 Answers
I’m interested in this too! Did you notice this drift pattern firsthand? Sometimes overlapping intents can be intentional if the website's goals include SEO or increasing reach. If it's unintentional, then it might be more of a language or taxonomy issue, which can be tricky to manage since language can get messy.
I believe that once a site gets larger, the content structure needs some governance, just like any other system. The drift often comes from flexible taxonomies and unchecked tag growth. Regular internal link audits and having clear routing constraints can really help prevent that chaos. Organic growth is great initially, but not sustainable in the long run.
Honestly, I think the best approach is to create the templates that content creators need. After all, content is primarily for the copywriters. But I also worry about where the structural guardrails lie. If you allow for unlimited tagging, cross-linking, or loosely defined hierarchies, then drift seems like it's going to happen no matter what.

That’s exactly my concern too! If there's too much freedom with tagging and linking, it could easily spiral out of control. Do you think it should be purely editorial, or should the system impose some constraints?