By clicking “Accept All Cookies”, you agree to the storing of cookies on your device to enhance site navigation, analyze site usage, and assist in our marketing efforts. View our Cookie Policy for more information.

Article

DeepSeek R1: The $6M Model That’s Reshaping the Global AI Race

DeepSeek's R1 model is revolutionising AI with its open-source nature, cost-effectiveness, and impressive performance. This article explores its impact, technical comparisons with OpenAI, and the implications for censorship, data privacy, and the global AI race.

Back

Author

Dylan Stewart

Writer

Identity

DeepSeek R1: The $6M Model That’s Reshaping the Global AI Race

January 28, 2025

Introduction

DeepSeek, a Chinese artificial intelligence (AI) startup, has recently made headlines with the release of its R1 model, a development that has sent ripples through the global tech community. This article delves into what DeepSeek R1 is, the reasons behind its rapid ascent, technical comparisons with models from industry leaders like OpenAI, the financial implications of its emergence, its potential impact on the future of AI, and the international ramifications in the ongoing AI race between the USA and China.

Understanding DeepSeek R1 and Its Meteoric Rise

DeepSeek R1 is an advanced AI reasoning model developed by DeepSeek, a company founded in 2023. The R1 model employs a "chain of thought" architecture, enhancing the quality of its responses by simulating human-like reasoning processes. Unlike many proprietary models, R1 is open-sourced and accessible for free, contrasting with the subscription-based models offered by competitors like OpenAI. This accessibility has contributed to its rapid adoption, with DeepSeek's ChatGPT competitor becoming the most-downloaded app in Apple's App Store shortly after its release.

DeepSeek's R1 model has garnered significant attention not only for its performance but also for its efficient development process. The company reportedly developed the R1 model in approximately two months, with a budget of around $5.6 million. This is notably cost-effective compared to the hundreds of millions or billions typically invested by other leading AI firms.

In terms of hardware, DeepSeek utilised about 2,000 Nvidia H800 chips for training the R1 model. These chips are designed to comply with U.S. export controls and are considered less advanced than Nvidia's top-tier offerings. In contrast, it's estimated that OpenAI employed approximately 25,000 Nvidia A100 chips to train models like GPT-4. This comparison underscores DeepSeek's achievement in developing a competitive AI model with significantly fewer resources.

DeepSeek's R1 model dramatically undercuts the costs of OpenAI's o1 models. Source: hackster.io

‍

Technical Comparisons to Other Models

In benchmark tests, DeepSeek R1 has demonstrated performance on par with, and in some cases surpassing, leading models from OpenAI. For instance, in mathematics benchmarks such as AIME 2024 and MATH-500, R1 has achieved scores slightly ahead of OpenAI's o1-1217 model. Due to the lower training costs and usage of outdated GPU chips, DeepSeek has challenged the prevailing notion that superior AI results require cutting-edge technology and vast financial resources.

DeepSeek’s R1 outperforms other companies’ latest models on the commonly-used AI tests. Source: NBC News

‍

Censorship and Data Privacy

There have been concerns raised about the level of censorship in DeepSeek's R1 model, with some claiming that it is heavily censored compared to other AI models. While R1 is open-sourced, which allows for greater transparency, it is still subject to regulatory controls, especially given its Chinese origins. Critics argue that, like OpenAI’s models, R1 may filter content to adhere to local regulations and to prevent harmful or offensive outputs, but the specifics of its censorship mechanisms have not been fully disclosed.

In terms of data privacy, DeepSeek's approach is somewhat less clear than OpenAI’s, which has a detailed privacy policy outlining the collection and usage of data for training purposes. OpenAI typically uses data to fine-tune its models, while R1’s data usage policies remain somewhat opaque. Another key distinction is the ability to download the R1 models locally, allowing users more control over the data and how it is processed, which contrasts with OpenAI’s cloud-based services. Using R1 online requires adherence to DeepSeek’s terms, but downloading it for local use may provide more flexibility and privacy, as users can manage data storage and processing without relying on an external server. However, this also raises potential concerns about security and the risk of misuse if not properly managed.

Financial Implications

The release of DeepSeek R1 has had significant financial repercussions. The model's state-of-the-art performance and cost efficiency have led to a massive sell-off of AI tech stocks, with Nvidia experiencing a historic $589 billion loss in market value. Many now speculate that we have been in a bubble and that many tech companies are significantly overvalued. This event has prompted analysts and investors to reassess the valuations of AI-centric companies and consider a potential shift in AI model training efficiency.

The share price percentage fall of Nvidia compared with the S&P 500 and Nasdaq over the past four trading days. Source: Financial Times

‍

Implications for the Future of AI

DeepSeek R1's emergence challenges the current AI development paradigm, suggesting that high performance can be achieved with more efficient and cost-effective approaches. This development could democratise AI, making it more accessible to smaller players and reducing energy consumption. The fact that R1 is open-sourced further accelerates this, lowering barriers to entry by providing the model for free, allowing smaller companies and researchers to access advanced AI technology. This fosters innovation, collaboration, and the potential for rapid AI development globally, though it also raises concerns about the concentration of powerful AI technologies and the risk of misuse.

International Implications in the AI Race Between the USA and China

DeepSeek R1’s success has intensified competition in the global AI race, particularly between the U.S. and China. While some view this as an “AI Sputnik moment” for China, others argue that Western firms still lead in general AI capabilities. The model’s development, despite U.S. export restrictions on advanced chips, raises critical questions about how nations can maintain their technological edge in an increasingly competitive field. Policymakers are now grappling with the broader geopolitical competition between the U.S. and China in the realm of AI and technology.

Conclusion

In conclusion, DeepSeek R1 represents a significant milestone in AI development, with far-reaching technical, financial, and geopolitical implications. Its emergence challenges existing assumptions and underscores the dynamic and rapidly evolving nature of the AI industry.