lang icon English
July 30, 2023, 9:22 a.m.
587

None

The choice of language for a Large Language Model (LLM) has a significant impact on its cost and can create a divide between English speakers and the rest of the world. A recent study reveals that English-language inputs and outputs are much cheaper than those in other languages. For example, Simplified Chinese costs about twice as much, Spanish costs 1. 5 times the price, and Shan language costs 15 times more. Research conducted by the University of Oxford found that processing a Burmese-written sentence with an LLM costs 198 tokens, while the same sentence in English only costs 17 tokens. This means that accessing the service through an API incurs 11 times more cost for the Burmese sentence compared to the English sentence. The tokenization model used by AI companies converts user input into computational cost, making models accessed outside the window of English language more expensive to access and train on. Languages like Chinese, with a different and more complex structure, have a higher rate of tokenization. For instance, a tokenization example by OpenAI's GPT3 tokenizer reveals that the phrase "your affection" would be only two tokens in English but eight tokens in Simplified Chinese. Despite the English phrase being longer (14 characters) compared to the Simplified Chinese one (4 characters), the higher token-to-char ratio makes it more expensive to implement the API for languages other than English.

The cost-effectiveness of English in AI-related expenses is unparalleled, with Chinese costing twice as much as English in terms of required tokens per output. This cost difference stems from the training data available to AI companies. Furthermore, achieving recursive training, or training AI models on their own outputs, is a desire for AI companies. However, research suggests that AI networks become unstable when trained multiple times on their own synthetic data. Different ways of quantifying costs, such as bit or character-counting, still face similar problems as tokenization and cannot surpass the practicality and lower costs of English. This issue is not limited to one model or version but affects multiple language models. The fact that the companies introducing Large Language Models are mostly based in America contributes to the cost difference, as lower usage costs and higher availability of quality data are inherent to the territory. The cost disparity has prompted several countries, including China and India, to develop their own initiatives to train and deploy native-language LLMs to keep up with the pace of innovation brought by English-based AI networks. It is crucial to proceed with caution in the complex realm of AI, considering the far-reaching consequences of each step taken.



Brief news summary

None

Watch video about

None

Try our premium solution and start getting clients — at no cost to you

I'm your Content Creator.
Let’s make a post or video and publish it on any social media — ready?

Language

Hot news

Oct. 24, 2025, 2:36 p.m.

C3.ai Restructures Sales Team Amid 33% Revenue Ou…

C3.ai, a leading enterprise artificial intelligence software provider, has announced a major restructuring of its global sales and services organization to boost operational efficiency and better align resources with long-term growth goals.

Oct. 24, 2025, 2:26 p.m.

Mondelez Implements Generative AI Tool to Reduce …

Snack manufacturer Mondelez International is utilizing a newly developed generative artificial intelligence (AI) tool to drastically cut costs in marketing content creation, achieving a 30% to 50% reduction in production expenses, according to a senior company executive.

Oct. 24, 2025, 2:19 p.m.

South Korea Reportedly to Build the World's Large…

South Korea is poised to make a major advancement in artificial intelligence by planning to build the world’s largest AI data center, with a power capacity of 3,000 megawatts—about three times larger than the existing "Star Gate" data center.

Oct. 24, 2025, 2:18 p.m.

OpenAI's ChatGPT Hits 700 Million Active Weekly U…

In August 2025, OpenAI announced a major milestone: ChatGPT, its advanced conversational AI platform, had reached an impressive 700 million active weekly users.

Oct. 24, 2025, 2:16 p.m.

Krafton Declares 'AI First' Strategy, Plans $70 M…

Krafton, the well-known publisher behind popular games like PUBG and Hi-Fi Rush, is undertaking a bold strategic transformation by integrating artificial intelligence (AI) into almost every aspect of its operations.

Oct. 24, 2025, 2:10 p.m.

Ethical Considerations in AI-Generated Video Cont…

The rise of AI-generated video content has sparked significant discussion in the digital media industry, bringing urgent ethical concerns to the forefront.

Oct. 24, 2025, 10:29 a.m.

AI and SEO: Enhancing User Experience and Engagem…

Artificial intelligence (AI) is becoming an essential tool for improving user experience and engagement through advanced search engine optimization (SEO) techniques.

All news

AI Company

Launch your AI-powered team to automate Marketing, Sales & Growth

and get clients on autopilot — from social media and search engines. No ads needed

Begin getting your first leads today