0:00
29:31
29:31

The insane engineering of Deepseek V4

Tech

Deepseek V4 explained. #ai #aitools #ainews #llm #agi #deepseek #claude #agi Thanks to our sponsor Abacus AI. Try ChatLLM & DeepAgent today: http://chatllm.abacus.ai/?token=aisearch Deepseek v4: https://api-docs.deepseek.com/news/news260424 LLMs explained: https://youtu.be/U2hZFMVNSE0 Residual connections: https://youtu.be/2IfAVV7ewO0 0:00 Deepseek V4 intro 1:00 Deepseek V4 specs 2:06 The challenge of 1M context 4:16 Hybrid attention 5:11 CSA & sparse selection 6:50 HCA 8:22 Sliding window attention 10:44 Insane efficiency gains 12:02 Signal explosion 13:00 Residual connections 13:52 mHC 14:17 ChatLLM 15:24 mHC continued 17:54 Muon 19:26 Infra challenges 22:31 Training challenges 24:09 Anticipatory routing 25:24 SOTA results Newsletter: https://aisearch.substack.com/ Find AI tools & jobs: https://ai-search.io/ Support: https://ko-fi.com/aisearch Here's my equipment, in case you're wondering: Lenovo Thinkbook: https://amzn.to/4jWeKwH Dell Precision 5690: https://www.dell.com/en-us/dt/ai-technologies/index.htm?utm_source=AISearchTools&utm_medium=youtube&utm_campaign=precisionai#tab0=0 GPU: Nvidia RTX 5000 Ada https://nvda.ws/3zfqGqS Mic: Shure SM7B https://amzn.to/3DErjt1 Audio interface: Scarlett Solo https://amzn.to/3qELMeu

ADVERTISEMENT

Comments 100

Sign in to join the conversation

Sign in
dagmarcascade31
dagmarcascade31 1 month ago

Winning the game even been crippled by unfair tactics. Well done to the team. Deepseek is a blessing to the world

maríaluisa_lemus
maríaluisa_lemus 1 month ago

Another game-changer for a week.

A
alix_legendre 1 month ago

Simply put, compared to other AI models, v4 is more like organizing pieces of paper that were originally filled with information into books, and then placing the books on a bookshelf. In this way, when a person needs to access previous information, it can quickly find the corresponding bookshelf, locate the book, and then find the desired content.

victoria.solano
victoria.solano 1 month ago

Great explanation!

H
hans-heinrich.segebahn 1 month ago

Very insightful and interesting

gaelhenrique_farias
gaelhenrique_farias 1 month ago

Awesome explanation! Awesome work by the DeepSeek team. What a gold mine this is. Revealing all the infra stuff is crazy, especially the fix for the loss spike could easily save millions or billions of dollars worldwide :D

T
tristan.miller 1 month ago

Thank you for this video and analysis AI Search, I learn a lot from your videos and know of other sources to use.

T
thomastempest62 1 month ago

Great breakdown!

V
vasudhamalhotra949 1 month ago

Awesome video thx for your non relenting work on AI; little hype and a proper analysis , I like it !

C
cynthia.horn 1 month ago

The most unbelievable thing is that this is just one of their company's side businesses, and it is never capital that drives technology, but genius engineers.

J
jonathan.hill 1 month ago

Thank you for the great content!

A
amanda.knight 1 month ago

I'm paying for Claude and Gemini but DeepSeek lately is the preferred model for symbolic calculations and astrophysics coding. Where Gemini and Claude are struggling DeepSeek managed to nail it from the first promt. I'm genuinely impressed.

B
benitosolorzano76 1 month ago

Man so glad you listened, these deep overviews into the tech are absolutely fascinating.

J
jacqueline.long 1 month ago

This is the best video I have watched explaining how this works in plain language and I am impressed in a day and age where hardly anything is clearly discussed.

M
matthewpalmer720 1 month ago

China has been dumping products, now its dumping AI tech. I LIKE IT

H
hakanhenk747 1 month ago

Extremely clever. Thanks for explaining it. I wonder why they didn't released also smaller version of the model

R
rael.novaes 1 month ago

Deepseek is the true OPEN AI, while OpenAI is more closed AI

M
marguerite_robin 1 month ago

some people understand stuff really good, others understand and explains it well. ur amazing thanks

ashleyjames985
ashleyjames985 1 month ago

my favorite ai model---deepseek!

F
frédérique_weiss 1 month ago

Amazing explanations as usual, thanks!