How GPT and other large language models (LLMs) work. Transformers explained. How AI works. #ai #llm #machinelearning #datascience #transformer Thanks to our sponsor Genspark. Try it for free https://bit.ly/4uM3PLS Attention is all you need https://arxiv.org/html/1706.03762v7 0:00 Intro 0:33 The transformer model 1:30 Predicting the next word 2:30 Tokenization 5:06 Representing meaning 7:17 Positional encoding 9:17 Attention head 14:49 Genspark 16:35 Multiple heads 19:30 Add and norm 21:45 Feed forward neural net 24:08 Multiple decoder blocks 24:50 Final layer 27:03 Training the model Newsletter: https://aisearch.substack.com/ Find AI tools & jobs: https://ai-search.io/ Support: https://ko-fi.com/aisearch Here's my equipment, in case you're wondering: Lenovo Thinkbook: https://amzn.to/4jWeKwH Dell Precision 5690: https://www.dell.com/en-us/dt/ai-technologies/index.htm?utm_source=AISearchTools&utm_medium=youtube&utm_campaign=precisionai#tab0=0 GPU: Nvidia RTX 5000 Ada https://nvda.ws/3zfqGqS Mic: Shure SM7B https://amzn.to/3DErjt1 Audio interface: Scarlett Solo https://amzn.to/3qELMeu
ADVERTISEMENT
This is genuinely the best explanation of A.I and transformers that i have ever seen, keep up the good work!
I already knew basically how it works, but you did an amazing job explaining it.
This has to be the best explanation video. Great work 👍
Thank you so much!
God tier level explaining
This was an absolutely stellar presentation on AI.
About time I understood transformers deeply. thanks a lot for the video.
really great information well-presented, with the exception of all the little meme animated GIFs of cats and cartoon characters. Those little animations detract from, rather than add to, the effectiveness of the presentation.
One of the finest video on Transformers. 👌
After 40+ years of IT experience (including programming) I find it still unbelievable that anything sensible and/or meaningful can come out of this process. But many thanks for explaining this madness.
This video is so well done and so far ahead its insane. Keep it up with these educational video!
Honestly bro I’m 22 and I can see ai taking over the world and we’re honestly blessed to have YouTubers like you to educate us
Alot of math but good video. We now can have a more clear picture how transformer model works.
I am a neuroscientist researcher and I LOOOOVEEEE your videos!!! Thank you so much!!! You explain the concept in a beautiful way!!! I study the biological neuronal network and how they communicate using electrophysiology single cell, and field recording plus using imaging techniques to record be neuronal population firing and integrating signals at the de dri tic level. I see so many analogies and I started going deeper into the research and reading how biological neuronal network could inspire the ANN!
Thank you .. was nicely presented
This video has some serious explanation of Transformer architecture. Great work. Congratulations.
Thanks for making this video. Good stuff.
Incredible description of the LLM core technology. Congratulations.
This has got to be the best introduction into the transformer architecture. I've watched so many videos trying to explain it and they all pale in comparison. Not too high level, not too low level, good shit. Thanks man!
Was well worth it, thanks for explaining beautifully... I've used some similar examples but this is nice to have to intro LLMs to more people :)