Applications

Best Way to Search Within PDFs Using S3 Vector Store?

August 9, 2025

Asked By CuriousReader42 On August 9, 2025

I'm looking to set up a system for searching through less than 500 PDF files, primarily journal articles. The goal is to have a search capability that can handle queries like, "What articles discuss frog habitats in North America?" Adding new PDFs will be rare—maybe just a few each month—and I expect only a couple of queries per day. I'm considering the S3 vector store for this purpose, but I've heard that Kendra can be quite expensive even for low usage. Is using a vector store a good option for my needs? I'm open to suggestions for an effective method.

1 Answer

Answered By TechSavvyJoe On August 11, 2025

I'm not sure if the S3 vector store supports natural language retrieval. I would suggest using Textract to extract text from your PDFs and then leverage Bedrock's capabilities to query that data. The only costs would come from the initial text conversion and then very minimal charges based on the tokens used during queries.

KnowledgeSeeker88 - August 11, 2025

Would using vector stores be an option for simple keyword searches? For instance, if a user searches for 'eardrum', could it return all PDFs containing that word? They're willing to adjust the functionality to keep costs reasonable.

Best Way to Search Within PDFs Using S3 Vector Store?

1 Answer

Related Questions

Fix Not Being Able To Add New Categories With Intuitive Category Checklist For Wordpress

Get Real User IP Without Installing Cloudflare Apache Module

How to Get Total Line Count In Visual Studio 2013 Without Addons

Install and Configure PhpMyAdmin on Centos 7

How To Setup PostfixAdmin With Dovecot and Postfix Virtual Mailbox

Dovecot Error Unknown database driver mysql

LEAVE A REPLY Cancel reply