I'm working on developing a local AI system that can handle tasks on my computer, both simple like opening Spotify and more complex tasks like downloading an image and setting it as my wallpaper. The idea is to control everything through voice commands or text inputs in an app, and I want everything to function locally. I'm in the process of creating a minimum viable product but keep hitting roadblocks. Is this concept feasible, or am I aiming too high?
6 Answers
Microsoft's been trying to create something like this for ages with tools like Clippy, and it hasn't quite worked out. It's a huge scope of work, especially when it comes to managing the conditional logic for different tasks. Just executing commands can lead to issues like unintentional file deletions if it’s not handled carefully.
Building something like J.A.R.V.I.S. is definitely a daunting task. While it's theoretically possible, doing it solo is pretty unrealistic. You'd likely need a solid team of developers working on it for several months, if not longer. It's not just about coding a few commands; you'd need to coordinate many different systems and ensure they work together smoothly.
It's possible, but it's a massive undertaking! You'll need things like a speech-to-text engine for processing audio commands and a way to execute those commands. You might think about using a local AI for the command logic, but keep in mind that the implementation can be tricky. Plus, there are security concerns like prompt injections to think about. It’s a lot of work, which is why you mostly see big companies making this kind of technology.
You might want to check out Clawdbot! It’s designed for the kind of functionality you’re looking for, and it might save you some time in development.
Honestly, the concept sounds like it’s edging close to what we’d call agentic AI. Just remember, you don't want the AI to have full control over your system due to risks of accidental deletions. Maybe limit some permissions until you can ensure it's safe!
If you seriously want to run this locally, be prepared for some hefty hardware requirements. A couple of RTX 3090s might be necessary to handle the load, especially if you want smoother performance. Just be ready for high energy costs, too!

Definitely agree! You could speed up the process using something like N8n to automate parts of it, and yes, running everything on-premises should mitigate a lot of risks.