lang icon En
Dec. 27, 2024, 6:57 a.m.
10887

DeepSeek's V3 Model Surpasses Tech Giants with Budget Innovation

Brief news summary

Chinese start-up DeepSeek has made waves in the global AI scene with the release of its new large language model (LLM), DeepSeek V3. With 671 billion parameters, it was trained in just two months at a cost of US$5.58 million. Despite using fewer computing resources compared to giants like Meta and OpenAI, DeepSeek V3 outperformed its competitors in benchmark tests. This success showcases the progress of Chinese AI companies, even in the face of US sanctions limiting access to advanced semiconductors. DeepSeek V3 is essential for generative AI services due to its ability to manage complex data and deliver precise predictions. Andrej Karpathy from OpenAI lauded DeepSeek's efficient training approach, achieved by sharing only pretrained weights, allowing others to use the model without disclosing its training code or datasets. This innovative strategy marks an important step for Chinese AI firms in the fiercely competitive global market.

DeepSeek’s V3 model was developed over two months for US$5. 58 million, utilizing fewer computing resources than its competitors. Reading Time: 2 minutes Why you can trust SCMP Reported by Ben Jiang in Beijing Published and Updated: 6:45pm, 27 Dec 2024 DeepSeek, a Chinese start-up, has stirred the global AI industry with its new large language model (LLM), which has outperformed models from Meta Platforms and ChatGPT creator OpenAI in benchmark tests. The Hangzhou-based company announced via WeChat on Thursday that its LLM, DeepSeek V3, boasts 671 billion parameters and was trained over roughly two months at a cost of US$5. 58 million, using notably fewer computing resources than those developed by larger tech firms. An LLM supports generative AI services like ChatGPT, and having a high parameter count is crucial for adapting to complex data patterns and making precise predictions. Computer scientist Andrej Karpathy, a founding team member at OpenAI, commented on the Chinese start-up's report on its new AI model, stating on social media platform X, “DeepSeek making it look easy …

with an open weights release of a frontier-grade LLM trained on a joke of a budget. ” Open weights imply releasing only the pretrained parameters, or weights, of an AI model, allowing third parties to use the model for inference and fine-tuning but not providing the training code, original data set, architecture details, and training methodology. DeepSeek’s creation of a strong LLM on a budget far smaller than what bigger companies like Meta and OpenAI usually invest highlights the progress made by Chinese AI firms, despite US sanctions restricting their access to advanced semiconductors necessary for training models.


Watch video about

DeepSeek's V3 Model Surpasses Tech Giants with Budget Innovation

Try our premium solution and start getting clients — at no cost to you

I'm your Content Creator.
Let’s make a post or video and publish it on any social media — ready?

Language

Hot news

Dec. 20, 2025, 1:24 p.m.

5 Cultural Attributes That Could Make or Break Yo…

Summary and Rewrite of “The Gist” on AI Transformation and Organizational Culture AI transformation poses primarily a cultural challenge rather than a purely technological one

Dec. 20, 2025, 1:22 p.m.

AI Sales Agent: Top 5 Future Sales Boosters of 20…

The ultimate aim of businesses is to expand sales, but stiff competition can impede this goal.

Dec. 20, 2025, 1:19 p.m.

AI and SEO: A Perfect Match for Enhanced Online V…

The incorporation of artificial intelligence (AI) into search engine optimization (SEO) strategies is fundamentally transforming how businesses improve their online visibility and attract organic traffic.

Dec. 20, 2025, 1:15 p.m.

Deepfake Technology Advances: Implications for Me…

Deepfake technology has made significant strides recently, producing highly realistic manipulated videos that convincingly portray individuals doing or saying things they never actually did.

Dec. 20, 2025, 1:13 p.m.

Nvidia's Open Source AI Push: Acquisition and New…

Nvidia has announced a significant expansion of its open source initiatives, signaling a strategic commitment to supporting and advancing the open source ecosystem in high-performance computing (HPC) and artificial intelligence (AI).

Dec. 20, 2025, 9:38 a.m.

N.Y. Gov. Kathy Hochul signs sweeping AI safety b…

On December 19, 2025, New York Governor Kathy Hochul signed the Responsible Artificial Intelligence Safety and Ethics (RAISE) Act into law, marking a significant milestone in the state’s regulation of advanced AI technologies.

Dec. 20, 2025, 9:36 a.m.

Stripe launches Agentic Commerce Suite for AI sal…

Stripe, the programmable financial services firm, has introduced the Agentic Commerce Suite, a new solution aimed at enabling businesses to sell through multiple AI agents.

All news

AI Company

Launch your AI-powered team to automate Marketing, Sales & Growth

and get clients on autopilot — from social media and search engines. No ads needed

Begin getting your first leads today