‘A virtual DPU within a GPU’: Could smart hardware hack be behind Deepseeks groundbreaking AI efficiency?


  • A new approach called dualpipe appears to be the key to deexek’s success
  • An expert describes it as a virtual DPU on GPU that maximizes bandwidth efficiency
  • While Deepseek has only used Nvidia GPUs, one wonders how AMD’s instinct would manage

China’s Deepseek Ai Chatbot has stunned the tech industry and represented a credible alternative to Openais Chatgpt to a fraction of the cost.

A recent article revealed Deepseek V3 was trained on a cluster of 2,048 NVIDIA H800 GPUs – paralyzed versions of H100 (we can only imagine how much more powerful it would run on AMD instinct accelerators!). It allegedly required 2.79 million GPU hours for prior, fine-tuning of 14.8 trillion tokens and cost-oming calculations made by The next platform – Only $ 5.58 million.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top