DeepSeek V3: Misidentification with ChatGPT Raises Concerns

Earlier this week, DeepSeek, a Chinese AI lab, launched DeepSeek V3, an AI model surpassing many others in efficiency for tasks like coding and writing. However, the model mistakenly believes it is ChatGPT. Tests revealed that DeepSeek V3 identifies as ChatGPT, claiming to be a version of OpenAI's GPT-4 model from 2023. The model even mimics GPT-4's responses, including telling similar jokes. The confusion arises because AI models like ChatGPT and DeepSeek V3 are statistical systems trained on vast datasets to predict patterns. DeepSeek has not disclosed its training data sources, but there's an abundance of public datasets with GPT-4-generated text. It seems DeepSeek V3 might have memorized some of these outputs. Training models on outputs from rival systems can be detrimental, causing inaccuracies and hallucinations.
It may also breach usage terms, as OpenAI prohibits using its product outputs to develop competing models. OpenAI's CEO, Sam Altman, subtly criticized this practice, highlighting the ease of copying versus innovating. This misidentification issue isn't unique to DeepSeek V3; other models like Google’s Gemini also misidentify. The growing prevalence of AI-generated content on the web, which serves as training data, exacerbates this problem. It's possible DeepSeek used ChatGPT-generated text for training, similar to past accusations against Google. Heidy Khlaaf from the AI Now Institute noted the appeal of using existing models' knowledge to save costs, despite the risks. This practice may lead to models like DeepSeek V3 unintentionally replicating biases and flaws from GPT-4, raising concerns about accuracy and self-identification.
Brief news summary
DeepSeek, a Chinese AI lab, recently launched DeepSeek V3, known for its coding and essay writing abilities. Yet, it mistakenly identifies itself as ChatGPT, often claiming to be OpenAI's GPT-4. According to tests by TechCrunch and social media posts, DeepSeek V3 mirrors GPT-4’s instructions and humor. The confusion may arise from its training data, possibly containing GPT-4 outputs, causing it to memorize and replicate them. Mike Cook from King’s College London warns that using competitor's outputs can degrade model quality and may violate terms of service, as OpenAI restricts using its outputs to develop competing models. OpenAI and DeepSeek have not commented on this issue, but OpenAI's CEO, Sam Altman, hinted that some competitors might copy rather than innovate. Similar situations have occurred with Google’s Gemini. As AI-generated content grows, distinguishing it from genuine data is harder, leading models like DeepSeek V3 to mistakenly incorporate GPT-4 content and potentially adopt its biases.
AI-powered Lead Generation in Social Media
and Search Engines
Let AI take control and automatically generate leads for you!

I'm your Content Manager, ready to handle your first test assignment
Learn how AI can help your business.
Let’s talk!

Blockchain ecosystem sets stage for 4B football f…
0xFútbol seeks to unite the global football community by integrating blockchain technology, enabling fans to actively participate, influence, and gain ownership within the sport.

Behind the Curtain: The Great Fusing
The ongoing convergence between the U.S. government and leading technology firms signals a transformative shift in artificial intelligence (AI) and space technology.

Why privacy in blockchain must start with open so…
Traditionally, trust was placed in centralized institutions like banks, payment networks, and clearinghouses—closed systems where users relied on external audits, government regulation, and long compliance histories to feel secure.

AI in Autonomous Vehicles: Navigating the Road Ah…
Artificial intelligence (AI) is central to the rapidly advancing autonomous vehicle industry, driving major changes in how vehicles function and interact with their environment.

Bergen County launches blockchain pilot to modern…
Bergen County has entered into a five-year partnership with blockchain startup Balcony to digitize and secure 370,000 property deeds, representing approximately $240 billion in real estate value.

AI in Healthcare: Enhancing Diagnostic Accuracy a…
Artificial intelligence (AI) is increasingly transforming healthcare by enhancing how medical professionals diagnose, treat, and manage various conditions.

This platform offers a blockchain solution to out…
Backed by major investors like Circle, Coinbase, and Solana Ventures, Zebec Network aims to build real-world financial infrastructure by bridging Web2 and Web3 with streaming payroll, crypto cards, and enterprise tools.