DeepSeek V4 AI Model Uses Chinese Chips, Boosts Open Source

DeepSeek's new V4 AI model is now using Chinese chips from Huawei, a change from Nvidia. This shows China's growing power in AI.

Hangzhou’s AI firm, DeepSeek, has launched its latest iteration, V4, signaling a persistent drive to influence the global open-source artificial intelligence landscape. The new model reportedly boasts enhanced reasoning and autonomous agentic capabilities, capable of tasks like code generation. It also features improved efficiency in handling larger data sets, a critical factor in the expanding scope of AI applications.

The V4 model’s development notably shifts away from previous reliance on Nvidia hardware, instead leveraging domestic Chinese chips from Huawei and Cambricon. This strategic pivot underscores China's ambition to build a self-sufficient AI ecosystem. While DeepSeek’s prior models, like R1, garnered attention for their performance and cost-effectiveness, the V4 aims to consolidate and advance this position.

PERFORMANCE CLAIMS AND GLOBAL COMPETITION

DeepSeek asserts that V4 surpasses existing open models in general world knowledge, though it concedes the model still lags behind industry giants such as Google's Gemini. The company’s proactive release strategy, exemplified by the quiet launch of V3.1 via WeChat and Hugging Face in the past, highlights a pragmatic approach to dissemination.

The company's trajectory, beginning with the R1 model’s surprising impact on the open-source AI scene, has already catalyzed broader adoption of open-source and open-weight models within China’s tech sector. This trend is expected to intensify competition, potentially prompting further scrutiny and risk assessments from international bodies and corporations regarding the use of Chinese AI technologies.

Read More: Big Tech Cuts Jobs While Spending Billions on AI

The competitive landscape is not confined to DeepSeek alone. China’s AI advancement includes other significant players such as Alibaba's Qwen, Moonshot AI's Kimi, and Baidu's Ernie, all contributing to a dynamic national effort.

BACKGROUND AND INDUSTRY DYNAMICS

DeepSeek’s emergence, particularly with its R1 model, demonstrated that high-performance AI could be achieved at a lower cost and with fewer resources compared to established Western competitors. This disrupted expectations and fueled comparisons, with some American firms integrating DeepSeek's models into their own applications.

The open-source AI community, where DeepSeek operates, fosters collaboration and rapid development. However, it also presents challenges. Reports from firms like Exiger and CrowdStrike have previously flagged potential security concerns and alleged military ties associated with Chinese AI companies, a recurring theme that may continue to shape discussions around models like V4.

Read More: Developer Trains 235 Million Parameter LLM From Scratch

The development and release of V4 come approximately a year after DeepSeek's initial foray into the global market, a period marked by significant technological strides and heightened geopolitical tensions in the AI domain.

Frequently Asked Questions

Q: What is DeepSeek's new V4 AI model?
DeepSeek has launched its V4 AI model, which is designed to improve reasoning and run tasks like coding. It can also handle more data.
Q: What hardware does the DeepSeek V4 model use?
The V4 model uses Chinese chips from Huawei and Cambricon, moving away from Nvidia hardware. This helps China build its own AI technology.
Q: How does DeepSeek V4 compare to other AI models?
DeepSeek claims V4 is better than other open models in general knowledge but still not as good as Google's Gemini. They released V3.1 quietly before.
Q: Why is DeepSeek's V4 important for China's AI?
This release shows China's aim to be self-sufficient in AI. It also increases competition with global AI leaders and may lead to more checks on Chinese AI tech.