Last week, the relatively unknown Chinese AI company DeepSeek shook the tech world when it launched an AI model that rivals the performance of OpenAI’s models. With a fraction of the typical cost and using far less powerful hardware than its competitors, DeepSeek is already challenging the status quo of AI development and raising questions about the future of global tech supremacy.
The Chinese company’s standout moment came on January 20 with the launch of its R1 reasoning model, which, the company claims, rivals OpenAI’s o1 model. More surprising than the model’s capabilities is its low training cost—around $5 million, a price tag that experts are now reassessing in the context of GPU power and model development.
DeepSeek’s claims that it achieved such impressive results with relatively inexpensive hardware has raised eyebrows and spurred debate across the tech world. Top AI engineers in the US have commended DeepSeek’s research for showing clever ways to build AI with fewer chips.
The startup’s engineers found a more efficient way to analyze data using these chips. AI systems learn by finding patterns in large amounts of data, like text, images, and sounds. DeepSeek used a method called “mixture of experts,” spreading the analysis across several specialized models, while reducing the time spent moving data around.
Prominent venture capitalist Marc Andreessen, called DeepSeek’s success “AI’s Sputnik moment,” referring to the shockwaves caused by the Soviet Union’s launch of the first satellite in 1957. With US stocks, particularly those of Nvidia, tumbling following the news, many are questioning whether American companies are still on top in the AI race.
In the wake of DeepSeek’s rise, comments from US political and tech figures, including President Donald Trump and OpenAI co-founder Sam Altman, have added fuel to the fire. Stocks tied to the AI sector saw some of their steepest declines in years, and the tremors were felt as far as Silicon Valley. Nvidia, the semiconductor giant that powers much of the AI industry, saw its largest single-day loss ever, prompting many to re-evaluate the status quo.
Innovative training methods and cost efficiency
One of the most talked-about aspects of DeepSeek’s breakthrough is its approach to training models. Despite using what many would consider low-end hardware, including the H800 GPUs—far less powerful than the H100s used by industry leaders—the company has made strides in maximizing efficiency. Experts like Marin Smiljanic, CEO of AI startup Omnisearch, point out that DeepSeek’s use of distillation and other optimizations allowed them to sidestep the usual reliance on high-end processing power.
Marin Smiljanic
“It’s definitely too early to call DeepSeek the established leader in reasoning models, since OpenAI’s o3 is more powerful, but the ~$5M training cost is insanely good and they’re definitely the leader in efficiency,” commented Smiljanic, highlighting the significance of DeepSeek’s success in terms of cost-effectiveness.
This efficient approach challenges conventional wisdom about the massive GPU requirements needed for efficient AI training. As the industry grapples with DeepSeek’s unexpected success, it’s clear that other companies may need to reassess their own strategies—particularly as demand for GPUs is expected to surge.
“OpenAI and Anthropic are probably in for a rough couple of months, as is NVIDIA. There is, though, a bull case for NVIDIA with many smaller LLM startups generating more demand for their chips. The rest of Big Tech will likely be fine,” Smiljanic explains.
What does this mean for the future of AI?
However, despite the buzz around DeepSeek, it is too early to declare it the undisputed leader in reasoning models. Machine learning engineer and tech entrepreneur Aleksa Gordic acknowledged that while DeepSeek’s models are impressive, OpenAI’s o3 is still more powerful.
Aleksa Gordic
However, DeepSeek’s models, especially the open-source DeepSeek-R1, are already making waves in the open-source AI community, with many praising the company’s willingness to release its models for public use under an MIT license.
“The Zero model achieves amazing reasoning capabilities,” Gordic remarked, adding that DeepSeek’s multi-stage training process, which includes reinforcement learning and supervised fine-tuning, has led to the emergence of advanced capabilities in their models, such as reflection and exploration of alternatives. “I definitely didn’t expect Chinese to be leading in the open-source AI,” Gordic adds.
Yet, as Martin Vechev, ETH Zurich professor and founder of Bulgaria-based INSAIT (Institute for Computer Science, Artificial Intelligence and Technology) points out, DeepSeek’s models are still specialized, with R1 excelling in reasoning tasks but not necessarily in multilingual applications.
“The DS series of models from China have been public for years. They are developed by strong researchers and engineers, who publish what they do in conferences and are constantly improvе the models, making them public, etc,” Vechev says.
“I expect we will see more benchmarks where R1/O1 do not work great. Then Google/OpenAI will release models that do well, then someone with access to more GPUs will still make an open version that is similar quality (DS or someone else). However, to make such a version, one will need more compute than $5M and closer to $50-$100M even for more specialized models,” the Bulgarian scientist explains.
Martin Vechev
Global implications and U.S.-China tech rivalry
Beyond the technical details, DeepSeek’s rise raises geopolitical questions. With tensions between the U.S. and China already at a high, DeepSeek’s rapid ascent is likely to fuel further scrutiny. Concerns about the potential risks of sending data to China are already circulating, though experts like Smiljanic argue that these fears are often overstated.
“Concerns around shipping data to China are overblown,” Smiljanic noted, pointing out that DeepSeek’s models are open-weights and can be run locally, much like other models from companies like Meta.
Still, the international implications are undeniable. As AI continues to become an increasingly important part of national security and global competition, the race to develop the most powerful, cost-effective AI systems is only going to intensify.
Furthermore, U.S. companies, while still at the forefront in some areas, may need to adapt quickly to stay competitive, particularly as smaller start-ups harness the power of cost-efficient, open-source AI.
Now, the real question moving forward is whether the rise of DeepSeek will mark the beginning of a new era in AI—one where the lines of power and innovation are not solely drawn by American companies, but by a global community of tech pioneers.
This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
Cookie
Duration
Description
cookielawinfo-checkbox-advertisement
1 year
Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Advertisement" category.
cookielawinfo-checkbox-analytics
1 year
Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Analytics" category.
cookielawinfo-checkbox-functional
1 year
The GDPR Cookie Consent plugin sets the cookie to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary
1 year
Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Necessary" category.
cookielawinfo-checkbox-others
1 year
Set by the GDPR Cookie Consent plugin, this cookie stores user consent for cookies in the category "Others".
cookielawinfo-checkbox-performance
1 year
Set by the GDPR Cookie Consent plugin, this cookie stores the user consent for cookies in the category "Performance".
CookieLawInfoConsent
1 year
CookieYes sets this cookie to record the default button state of the corresponding category and the status of CCPA. It works only in coordination with the primary cookie.
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Cookie
Duration
Description
__cf_bm
30 minutes
Cloudflare set the cookie to support Cloudflare Bot Management.
mailchimp_landing_site
1 month
MailChimp sets the cookie to record which page the user first visited.
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
Cookie
Duration
Description
_fbp
3 months
Facebook sets this cookie to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising after visiting the website.
_ga
1 year 1 month 4 days
Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
_ga_*
1 year 1 month 4 days
Google Analytics sets this cookie to store and count page views.
_gat_gtag_UA_*
1 minute
Google Analytics sets this cookie to store a unique user ID.
_gid
1 day
Google Analytics sets this cookie to store information on how visitors use a website while also creating an analytics report of the website's performance. Some of the collected data includes the number of visitors, their source, and the pages they visit anonymously.
CONSENT
2 years
YouTube sets this cookie via embedded YouTube videos and registers anonymous statistical data.
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
Cookie
Duration
Description
test_cookie
15 minutes
doubleclick.net sets this cookie to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE
5 months 27 days
YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
YSC
session
Youtube sets this cookie to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices
never
YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-device-id
never
YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt.innertube::nextId
never
YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests
never
YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.