Looking for AWS Architecture and Cost Advice for My Language Learning App

0
19
Asked By TechWizard87 On

Hello everyone,

I'm in the process of developing a mobile app for **Japanese conversation practice**. Here's a quick rundown of my tech stack:

- **Frontend:** React Native / Flutter
- **Backend:** Django
- **AI APIs:** Speech-to-Text → LLM reply → Text-to-Speech (using ChatGPT or Gemini)

**App Flow:**
The user speaks, which goes through the Django API for transcription, followed by getting a reply from the AI, and finally returning an audio response to the user.

**Requirements:**
- Approximately 1000 concurrent users
- High API concurrency handling
- Seeking a **cost-effective AWS setup**

**I'm looking for advice on:**
- What AWS architecture would be best (e.g., EC2, ECS, Lambda, etc.)?
- Tips on managing concurrent audio processing
- A rough estimate of **monthly costs**
- Common pitfalls to avoid when setting up this system

Any insights or real-world experiences would be greatly appreciated!

4 Answers

Answered By CloudGuru42 On

You should definitely consider a serverless approach. Running Django on Lambda can handle a lot of concurrent users efficiently. For the speech-to-text part, Amazon Transcribe is great; they have a streaming API that costs about 2 cents per minute. You can send the stream to Amazon Bedrock and use models like Nova for translation, which keeps costs down compared to using external APIs. This setup can potentially cost around $200 a month with 1000 users making about 100 requests daily. Also, using WebSockets for connections can reduce latency compared to REST calls.

Answered By FirstTimer01 On

Starting out, try to focus on just a few users at first instead of aiming for 1000 concurrent users right away. Build your architecture for flexibility and agility, which will allow you to make necessary changes as you gather feedback and iterate on your app. Find that product-market fit before scaling too aggressively!

Answered By AI_Stranger On

A good point is to maybe simplify your workflow. If you plan to use an OpenAI model, why not just stick with their real-time API? It could make things less complex by directly converting speech to text and generating the audio response, skipping a lot of the back and forth. Just a thought!

Answered By BudgetCoder99 On

When estimating costs, just remember that while you can keep things cheap, the AI service subscriptions can add up quickly. Make sure to account for all those variables and keep an eye on your hardware needs, as they’ll affect your monthly expenses, but setup can typically be around 10-15k for a solid production environment.

Related Questions

LEAVE A REPLY

Please enter your comment!
Please enter your name here

This site uses Akismet to reduce spam. Learn how your comment data is processed.