I'm seeking some advice on enterprise-level search indexing. I have a client with a massive file server that houses around 14 million files, organized via multiple shares. While the Windows Search Service is running and claims to have indexed everything, the search functionality isn't working as expected. The index file is over 1TB, and I understand that it's not optimal for indexing more than a million files. Additionally, the index is currently on a HDD RAID setup, not SSD.
The client primarily uses Mac systems and is accustomed to the Spotlight search functionality. They are open to spending money to enhance search capabilities for their file server shares (which are accessed via SMB3 from both Macs and PCs).
I've been searching for an effective solution online but haven't found anything promising. I'm hesitating to invest in SSDs for improving index response time since Windows Search isn't recommended for this number of files anyway. It would be great to find a product that can deliver Spotlight-quality search results for such large data sets stored on an on-premises file server. The client is flexible and willing to explore new hardware, operating systems, or software to achieve the desired search experience. Any recommendations?
5 Answers
If you're considering a broader shift, think about migrating the data to OneDrive or Teams to eliminate on-prem infrastructure altogether. It could simplify things, but I understand you might want to keep things local.
Consider investing in a proper document management system. They can scale effectively and usually address more than just search issues — they're designed for comprehensive data management.
Installing SSDs could be your fastest and most cost-effective solution here. If most of your files are documents, document management systems often provide highly optimized search features as well.
You might want to check out the 4ig system. It's on the pricier side, but it could provide the functionality you're looking for. Here's the link: infinnium.com/products/4ig.
Thanks for the suggestion, I'll look into it!
Mylex offers a solution that could integrate well with your existing data sources and takes file permissions into account when presenting search results.
I'll reach out to them. Thanks for the recommendation! Are you currently using their services?

That's definitely not an option for us. We need to keep everything on-prem. OneDrive for this volume of data could become a hassle.