I'm trying to find some straightforward resources for understanding how machine learning (ML) and large language models (LLMs) work, without the over-the-top hype or gloomy predictions about AI. There are folks around me who enthusiastically talk about AI but don't really grasp how it operates because they've never used it. They often treat tools like ChatGPT as if they're flawless, which isn't the case. I'm searching for a short video or a website that clearly explains the basics of LLMs and ML without the corporate jargon or sarcastic critiques. It's hard to sift through the multitude of AI explainers online, and I want something trustworthy that covers both the strengths and weaknesses of these technologies. I did like a certain video I found, but it felt a bit outdated at times.
5 Answers
Machine learning is pretty mathematical, so diving deeper into the math can lead you down a rabbit hole. The video you liked already gives a great high-level overview! If you want to get more technical, you could explore concepts like PageRank or Word2Vec, which are foundational and sort of carry you towards understanding LLMs better. Just keep in mind that going too deep might require some patience since it can get a bit dry.
Honestly, even just chatting with ChatGPT can provide some insights, albeit with a bit of irony! It’s interesting to explore how these AIs respond, even if they aren’t perfect.
I think 3Blue1Brown has some great explanations on machine learning. They break things down really well, and this (https://www.youtube.com/watch?v=aircAruvnKk&list=PLZHQObOWTQDNU6R1_67000Dx_ZCJB-3pi) is particularly useful. If you're curious about how LLMs function, their chapters on this are super insightful, especially chapters 5 and 6!
You might want to check out this video [here](https://www.youtube.com/watch?v=7xTGNNLPyMI). The creator co-founded OpenAI, so it's pretty informative about LLMs, although it’s a bit long at three hours. It's worth it if you're looking for an in-depth explanation.
You should definitely check out Welch Labs; they have an amazing video series on modern machine learning. Their visuals are super clear and really help explain why things work the way they do, similar to what 3Blue1Brown does for math. They keep producing content, so there's always something new to learn from them. You can find their channel [here](https://youtube.com/@welchlabs).

Totally agree! Those chapters really clarify a lot about how LLMs operate under the surface, even if it can be complex.