How can I set up a development environment for junior engineers using Kafka, Spark, and S3?

0
8
Asked By CuriousCoder42 On

I'm looking to create a reproducible development environment for about seven junior engineers, focusing on technologies like Kafka, Spark, S3, and data pipelines. The setup needs to work both locally (offline) and on AWS. As I don't have much software engineering experience, I'd appreciate tips or resources on whether using Docker Compose with separate EC2 instances for each engineer is the right approach for isolation. Also, should I incorporate Terraform into this setup or is there a simpler way?

2 Answers

Answered By TechieTom On

It really depends on what you’re trying to achieve. I'm a bit puzzled as to why everyone needs Kafka and Spark if no one is familiar with the dev environment setup. Are you just trying to get everything up and running before they start?

Answered By DevDude99 On

You're on the right track! A Docker Compose setup with Kafka, Spark, and MinIO (which is S3-compatible) is a solid choice. Giving each junior engineer their own small EC2 instance is wise—it keeps things isolated so one broken pipeline won't affect the whole team. You can have them use MinIO locally and switch to real S3 through environment variables when they're on AWS. For now, I'd advise skipping Terraform. Just use a launch template with a user-data script to install Docker; that should suffice for your seven instances. Pre-baking an AMI with Docker can save you from repeating instructions.

Related Questions

LEAVE A REPLY

Please enter your comment!
Please enter your name here

This site uses Akismet to reduce spam. Learn how your comment data is processed.