Deepseek V4 explained. #ai #aitools #ainews #llm #agi #deepseek #claude #agi Thanks to our sponsor Abacus AI. Try ChatLLM & DeepAgent today: http://chatllm.abacus.ai/?token=aisearch Deepseek v4: https://api-docs.deepseek.com/news/news260424 LLMs explained: https://youtu.be/U2hZFMVNSE0 Residual connections: https://youtu.be/2IfAVV7ewO0 0:00 Deepseek V4 intro 1:00 Deepseek V4 specs 2:06 The challenge of 1M context 4:16 Hybrid attention 5:11 CSA & sparse selection 6:50 HCA 8:22 Sliding window attention 10:44 Insane efficiency gains 12:02 Signal explosion 13:00 Residual connections 13:52 mHC 14:17 ChatLLM 15:24 mHC continued 17:54 Muon 19:26 Infra challenges 22:31 Training challenges 24:09 Anticipatory routing 25:24 SOTA results Newsletter: https://aisearch.substack.com/ Find AI tools & jobs: https://ai-search.io/ Support: https://ko-fi.com/aisearch Here's my equipment, in case you're wondering: Lenovo Thinkbook: https://amzn.to/4jWeKwH Dell Precision 5690: https://www.dell.com/en-us/dt/ai-technologies/index.htm?utm_source=AISearchTools&utm_medium=youtube&utm_campaign=precisionai#tab0=0 GPU: Nvidia RTX 5000 Ada https://nvda.ws/3zfqGqS Mic: Shure SM7B https://amzn.to/3DErjt1 Audio interface: Scarlett Solo https://amzn.to/3qELMeu
ADVERTISEMENT
Winning the game even been crippled by unfair tactics. Well done to the team. Deepseek is a blessing to the world
Another game-changer for a week.
Simply put, compared to other AI models, v4 is more like organizing pieces of paper that were originally filled with information into books, and then placing the books on a bookshelf. In this way, when a person needs to access previous information, it can quickly find the corresponding bookshelf, locate the book, and then find the desired content.
Great explanation!
Very insightful and interesting
Awesome explanation! Awesome work by the DeepSeek team. What a gold mine this is. Revealing all the infra stuff is crazy, especially the fix for the loss spike could easily save millions or billions of dollars worldwide :D
Thank you for this video and analysis AI Search, I learn a lot from your videos and know of other sources to use.
Great breakdown!
Awesome video thx for your non relenting work on AI; little hype and a proper analysis , I like it !
The most unbelievable thing is that this is just one of their company's side businesses, and it is never capital that drives technology, but genius engineers.
Thank you for the great content!
I'm paying for Claude and Gemini but DeepSeek lately is the preferred model for symbolic calculations and astrophysics coding. Where Gemini and Claude are struggling DeepSeek managed to nail it from the first promt. I'm genuinely impressed.
Man so glad you listened, these deep overviews into the tech are absolutely fascinating.
This is the best video I have watched explaining how this works in plain language and I am impressed in a day and age where hardly anything is clearly discussed.
China has been dumping products, now its dumping AI tech. I LIKE IT
Extremely clever. Thanks for explaining it. I wonder why they didn't released also smaller version of the model
Deepseek is the true OPEN AI, while OpenAI is more closed AI
some people understand stuff really good, others understand and explains it well. ur amazing thanks
my favorite ai model---deepseek!
Amazing explanations as usual, thanks!