lang icon En
Jan. 23, 2025, 11:01 p.m.
4313

DeepSeek-R1: A Cost-Effective Chinese AI Model Rivals OpenAI's o1

Brief news summary

The DeepSeek-R1 language model, developed by a Chinese startup, is attracting significant interest due to its cost-efficiency relative to OpenAI's models. Initial analyses reveal R1's impressive capabilities in areas such as chemistry, mathematics, and programming, with researcher Elvis Saravia noting its "wild and totally unexpected" abilities. Notably, DeepSeek employs an "open-weight" system, encouraging collaboration and algorithm enhancement among researchers, in contrast to OpenAI's proprietary methods. Furthermore, R1 operates at about one-thirtieth the cost of OpenAI's offerings, enabling more affordable testing options. This development reflects rapid advancements in China's language model sector, even amidst U.S. restrictions on AI tech exports. Experts indicate that R1's efficiency represents a trend towards resource optimization rather than solely enhancing computational power, suggesting a shrinking technological gap between the U.S. and China. This progress may foster opportunities for collaboration in the AI field, moving beyond conventional competitive dynamics.

A large language model developed in China, known as DeepSeek-R1, is exciting scientists as a cost-effective and open alternative to reasoning models like OpenAI's o1. These models produce answers in a sequence that mimics human reasoning, making them more skilled than older language models in tackling scientific issues, potentially proving beneficial for research. Early assessments of R1, which launched on January 20, indicate that its performance in tasks related to chemistry, mathematics, and coding rivals that of o1— which impressed researchers upon its release by OpenAI in September. “This is astonishing and completely unexpected, ” commented Elvis Saravia, an AI researcher and co-founder of the UK-based AI consulting firm DAIR. AI, on X. R1 is notable for another reason. The start-up DeepSeek, based in Hangzhou, has made the model ‘open-weight’, allowing researchers to analyze and enhance the algorithm. It’s published under an MIT license, enabling free reuse, though it’s not fully open source, as the training data remains undisclosed. “The transparency shown by DeepSeek is remarkable, ” states Mario Krenn, head of the Artificial Scientist Lab at the Max Planck Institute for the Science of Light in Erlangen, Germany. In contrast, o1 and other models from OpenAI in San Francisco, including the recent o3, are considered “essentially black boxes, ” he notes. While AI hallucinations cannot be entirely eliminated, certain techniques can help mitigate their impact. DeepSeek hasn't disclosed the total cost of training R1, but it charges users about one-thirtieth of o1’s operational expenses. The company has also developed mini ‘distilled’ versions of R1, allowing researchers with limited computing resources to experiment with the model. Krenn notes, “An experiment that cost over £300 with o1 was less than $10 with R1, ” highlighting a significant difference that could influence future adoption. R1 is part of a surge in Chinese large language models (LLMs).

Emerging from a hedge fund, DeepSeek gained recognition last month with the release of a chatbot named V3, which surpassed major competitors despite a limited budget. Experts estimate that the hardware rental for training the model was around $6 million, compared to over $60 million for Meta's Llama 3. 1 405B, which utilized 11 times the computing resources. The excitement around DeepSeek stems in part from its achievement in developing R1 amidst U. S. export controls restricting Chinese companies' access to top-tier AI processing chips. “The fact that it originates from China demonstrates that efficient resource management is more critical than sheer computing power, ” asserts François Chollet, an AI researcher based in Seattle, Washington. DeepSeek’s advancements indicate that “the gap previously perceived as a U. S. lead has significantly narrowed, ” observed Alvin Wang Graylin, a technology expert in Bellevue, Washington, who is affiliated with the Taiwan-based immersive technology company HTC, on X. “Both countries should consider a cooperative strategy for developing advanced AI rather than continuing the current futile arms race. ”


Watch video about

DeepSeek-R1: A Cost-Effective Chinese AI Model Rivals OpenAI's o1

Try our premium solution and start getting clients — at no cost to you

I'm your Content Creator.
Let’s make a post or video and publish it on any social media — ready?

Language

Content Maker

Our unique Content Maker allows you to create an SEO article, social media posts, and a video based on the information presented in the article

news image

Last news

The Best for your Business

Hot news

Jan. 28, 2026, 1:25 p.m.

Runway Partners with IMAX to Showcase AI-Generate…

Runway Partners with IMAX to Showcase AI-Generated Films Across Major U

Jan. 28, 2026, 1:22 p.m.

Nvidia's AI Chip Sales Surge Amid Data Center Dem…

Nvidia, a leading technology firm famed for its graphics processing units (GPUs), has announced a substantial rise in sales of its AI chips.

Jan. 28, 2026, 1:19 p.m.

UK Proposes Forcing Google to Let Publishers Opt …

Britain’s competition watchdog, the Competition and Markets Authority (CMA), has proposed that Google should allow news sites and content creators to opt out of having their online content scraped for use in its AI-generated news summaries.

Jan. 28, 2026, 1:15 p.m.

Walsin Lihwa Captures 50% Share in China's AI Ser…

Walsin Lihwa, a leading stainless steel producer, has made a major breakthrough by entering China’s artificial intelligence (AI) server market.

Jan. 28, 2026, 1:13 p.m.

AI Video Synthesis Enhances Virtual Reality Exper…

Virtual reality (VR) developers are increasingly integrating artificial intelligence (AI) video synthesis technologies to craft highly realistic video content that reacts and adapts to user interactions in real time.

Jan. 28, 2026, 1:07 p.m.

Tesla will report Q4 earnings results after the 4…

Tesla is set to release its Q4 earnings results following the market close at 4 p.m.

Jan. 28, 2026, 9:31 a.m.

Project Prometheus: Jeff Bezos's New AI Venture

Project Prometheus is an innovative artificial intelligence (AI) startup founded by Jeff Bezos in November 2025.

All news

AI Company

Launch your AI-powered team to automate Marketing, Sales & Growth

and get clients on autopilot — from social media and search engines. No ads needed

Begin getting your first leads today