lang icon En
Jan. 9, 2025, 12:19 p.m.
3187

Elon Musk and AI Experts Discuss Synthetic Data for AI Training

Brief news summary

Elon Musk and AI experts highlight a growing scarcity of real-world data for training AI models. Musk suggests that most usable human knowledge for AI was exhausted by last year. This aligns with Ilya Sutskever's "peak data" theory and encourages a shift toward synthetic data, created by AI models themselves. Musk believes AI must generate and evaluate its own data for advancement. Tech giants like Microsoft, Meta, OpenAI, and Anthropic are already adopting synthetic data for AI training. Gartner predicts that by 2024, 60% of AI and analytics data will be synthetic. Examples of AI models using synthetic data include Microsoft's Phi-4, Google's Gemma, Anthropic's Claude 3.5 Sonnet, and Meta's Llama series. Synthetic data offers cost benefits. For instance, Writer's Palmyra X 004 model, primarily built with synthetic data, cost $700,000, much less than the $4.6 million spent on a similar model by OpenAI.

Elon Musk agrees with other AI experts that there is a scarcity of real-world data left for training AI models. During a livestreamed discussion with Stagwell chairman Mark Penn on X, Musk remarked, “We’ve essentially used up the cumulative sum of human knowledge in AI training, " noting that this depletion occurred last year. Musk, who runs the AI company xAI, echoed sentiments from former OpenAI chief scientist Ilya Sutskever, who addressed this issue at the NeurIPS machine learning conference in December. Sutskever mentioned that the AI industry had reached “peak data, ” predicting that the shortage of training data would necessitate a change in current model development practices. Musk suggested that synthetic data, generated by AI itself, is the way forward, saying, “The only way to supplement [real-world data] is with synthetic data, where the AI creates [training data]. With synthetic data, [AI] will grade itself and engage in self-learning. ” Many companies, including Microsoft, Meta, OpenAI, and Anthropic, are already incorporating synthetic data to train their primary AI models.

According to Gartner, 60% of data used for AI and analytics projects in 2024 will be synthetically generated. Microsoft’s recently open-sourced Phi-4 was trained using both synthetic and real-world data, as were Google’s Gemma models. Anthropic also utilized synthetic data for its highly capable Claude 3. 5 Sonnet system, and Meta fine-tuned its latest Llama series of models with AI-generated data. Training models on synthetic data also offers cost benefits. The AI startup Writer claims its Palmyra X 004 model, largely built on synthetic sources, cost just $700, 000 to develop, compared to the estimated $4. 6 million for a similar-sized model from OpenAI.


Watch video about

Elon Musk and AI Experts Discuss Synthetic Data for AI Training

Try our premium solution and start getting clients — at no cost to you

I'm your Content Creator.
Let’s make a post or video and publish it on any social media — ready?

Language

Hot news

Dec. 30, 2025, 9:31 a.m.

I work in tech sales and use AI every day — but I…

This as-told-to essay is drawn from a conversation with Antoine Wade, a tech sales professional based in San Antonio.

Dec. 30, 2025, 9:24 a.m.

Meta Platforms Announces $10 Billion Investment i…

Meta Platforms Inc.

Dec. 30, 2025, 9:23 a.m.

HVLP Copper Foil Sees Demand Surge; China Acceler…

The global HVLP (Very Low Profile) copper foil market is experiencing significant growth this year, primarily driven by rising demand for AI servers.

Dec. 30, 2025, 9:14 a.m.

The AI processor market explosion

Jon Peddie, founder and president of Jon Peddie Research, was the featured guest on DE 24/7 tech podcaster Kenneth Wong’s show, where he discussed the rapidly expanding AI processor industry and the daily fluctuations within this billion-dollar market.

Dec. 30, 2025, 9:13 a.m.

AI and SEO: Understanding the Synergy Between Tec…

The evolving relationship between artificial intelligence (AI) and search engine optimization (SEO) is profoundly transforming the digital marketing landscape.

Dec. 30, 2025, 9:13 a.m.

AI in Video Production: Streamlining Post-Product…

The post-production phase of video production is undergoing a major transformation with the growing adoption of artificial intelligence (AI) technologies.

Dec. 30, 2025, 5:25 a.m.

Intel's Leadership Restructuring Amid AI Chip Mar…

Intel Corporation has initiated significant leadership changes and workforce reductions within its foundry operations as part of a broader corporate restructuring aimed at refocusing its business strategy to better address the rapidly evolving artificial intelligence (AI) market.

All news

AI Company

Launch your AI-powered team to automate Marketing, Sales & Growth

and get clients on autopilot — from social media and search engines. No ads needed

Begin getting your first leads today