How to Efficiently Process Images Using Multiple Machines?

0
4
Asked By TechWiz47 On

Hey everyone, I need some guidance for a project I'm working on. I'm processing batches of images and want each machine in my cluster to serve as both the source and destination for these images. Here's the workflow: I receive a hard drive with images, copy them to a machine in my cluster, then an API processes these images. The paths of the original images are stored in PostgreSQL, and I use Celery/Redis for task management, with Keda for scaling.

3 Answers

Answered By DataDrivenDude On

I think I need a bit more detail about your requirements. How many images are you processing at once? What sizes are we talking about? If you're looking for high efficiency, you might consider using something like Argo Workflows. But I'd really want to know more about your current workflow before making any specific suggestions.

Answered By CuriousCoder98 On

It sounds like you're overthinking this a bit! Instead of worrying about which machine is the source or destination, why not just mount a shared storage solution? You can have your containers access the same storage, and no need for complicated routing between machines. Could you share more about your specific setup?

Answered By EfficiencyExpert On

If your goal is for each node to act as both a source and a destination, there might be a simpler approach. Why not just schedule the Celery tasks to run on the same node as the images? That way, you're minimizing network traffic and complexity. You could categorize the workers by node to streamline the task distribution without sending around IPs.

Related Questions

LEAVE A REPLY

Please enter your comment!
Please enter your name here

This site uses Akismet to reduce spam. Learn how your comment data is processed.