‘A virtual DPU within a GPU’: Could clever hardware hack be behind DeepSeek’s groundbreaking AI efficiency?

May Be Interested In:OHL Roundup: Allard scores winner, Greyhounds edge Wolves 3-2



  • A new approach called DualPipe seems to be the key to DeekSeek’s success
  • One expert describes it as an on-GPU virtual DPU that maximizes bandwidth efficiency
  • While DeepSeek has used Nvidia GPUs only, one wonders how AMD’s Instinct would fare

China’s DeepSeek AI chatbot has stunned the tech industry, representing a credible alternative to OpenAI’s ChatGPT at a fraction of the cost.

A recent paper revealed DeepSeek V3 was trained on a cluster of 2,048 Nvidia H800 GPUs – crippled versions of the H100 (we can only imagine how much more powerful it would be running on AMD Instinct accelerators!). It reportedly required 2.79 million GPU-hours for pretraining, fine-tuning on 14.8 trillion tokens, and cost – according to calculations made by The Next Platform – a mere $5.58 million.

share Share facebook pinterest whatsapp x print

Similar Content

News18
Saif Ali Khan Attacked News LIVE: Attacker At Actor’s Home For Nearly An Hour, CCTV Clip Shows – News18
Government rolls out new consumer protection AI tools | Mint
Government rolls out new consumer protection AI tools | Mint
LANL punts system to ID satellites at risk of collision
LANL punts system to ID satellites at risk of collision
Donald Trump and Vladimir Putin
Donald Trump issues Ukraine warning to “smart” Putin
Elon Musk sued by 26-year-old influencer for custody, paternity test after giving birth to his 13th baby
Elon Musk sued by 26-year-old influencer for custody, paternity test after giving birth to his 13th baby
Solar panels in space show potential for liftoff, despite cost concerns
Solar panels in space show potential for liftoff, despite cost concerns
Revealing the Facts: Today's Critical Headlines | © 2025 | Daily News