lang icon En
June 18, 2024, 5:41 p.m.
2654

None

Brief news summary

Meta's Fundamental AI Research (FAIR) team has released five new artificial intelligence (AI) research models. These models include ones that can generate text and images, detect AI-generated speech within audio snippets, and offer improved control over AI music generation. The models were released to encourage the advancement of AI in a responsible manner. One of the models, Chameleon, can understand and generate both images and text, allowing for the generation of captions for images or the creation of new scenes using text and images. Other models include pretrained code completion models and an audio watermarking technique for detecting AI-generated speech within audio. Additionally, Meta has released evaluation code and annotations to improve the diversity of text-to-image generation systems. The company plans to invest between $35 billion and $40 billion on AI and metaverse-development by the end of 2024.

Meta's Fundamental AI Research (FAIR) team announced the release of five new artificial intelligence (AI) research models. These models have various functionalities, such as text and image generation, AI-generated speech detection, and code completion. One of the models, called Chameleon, is capable of understanding and generating both images and text. It can take input that includes text and images and produce a combination of text and images. Meta mentioned that this feature could be used to generate captions for images or create new scenes using text prompts and images. Pretrained models for code completion were also released. These models were trained using Meta's multitoken prediction approach, which involves training large language models (LLMs) to predict multiple future words simultaneously, rather than predicting one word at a time. Another model, JASCO, provides more control over AI music generation.

Instead of relying solely on text inputs, this model can accept various inputs like chords or beats, allowing the incorporation of symbols and audio in a single text-to-music generation model. A model called AudioSeal introduces an audio watermarking technique that enables localized detection of AI-generated speech. It can pinpoint AI-generated segments within larger audio snippets and detect AI-generated speech up to 485 times faster than previous methods. The fifth AI research model released aims to enhance geographical and cultural diversity in text-to-image generation systems. Meta has provided geographic disparities evaluation code and annotations to improve evaluations of text-to-image models. Meta mentioned in an earnings report that it plans to invest $35 billion to $40 billion in AI and metaverse development by the end of 2024. Meta CEO Mark Zuckerberg also highlighted the company's various AI services, including AI assistants, augmented reality apps, and business AIs. To stay updated on AI news, you can subscribe to the daily AI Newsletter from PYMNTS.


Watch video about

None

Try our premium solution and start getting clients — at no cost to you

I'm your Content Creator.
Let’s make a post or video and publish it on any social media — ready?

Language

Hot news

Feb. 25, 2026, 5:38 a.m.

Seadronix Unveils NAVISS 2.0 AI-Powered Ship Navi…

Seadronix, a leading innovator in artificial intelligence and autonomous ship navigation, has launched its latest breakthrough, NAVISS 2.0, at the SMM 2024 Maritime Messe in Hamburg.

Feb. 25, 2026, 5:27 a.m.

Interact Marketing Warns of AI-Generated Content …

Interact Marketing has issued a caution about the widespread use of detectable AI-generated marketing content and the resulting decline in quality standards.

Feb. 25, 2026, 5:24 a.m.

75% of Marketers Have Adopted AI Yet Still Use It…

A recent Salesforce report examines the current state of artificial intelligence (AI) adoption in marketing, revealing that about 75% of marketers have integrated AI into their strategies.

Feb. 25, 2026, 5:22 a.m.

AI Video Analytics Enhances Sports Broadcasting E…

In the rapidly evolving field of sports broadcasting, Artificial Intelligence (AI) integration is revolutionizing how audiences experience live games.

Feb. 25, 2026, 5:19 a.m.

Screaming Frog SEO Spider Integrates AI for Enhan…

Screaming Frog, a leading SEO software provider, has enhanced its popular SEO Spider tool by integrating direct AI API capabilities, allowing users to access advanced AI models like OpenAI, Gemini, and Claude within the interface for various SEO tasks.

Feb. 25, 2026, 5:14 a.m.

Google Partners with AP to Deliver Real-Time News…

Google has announced a landmark partnership with The Associated Press (AP) to deliver real-time news updates through its advanced Gemini AI chatbot.

Feb. 24, 2026, 9:20 a.m.

SoundHound AI Launches Sales Assist Agent for Ret…

At the Mobile World Congress (MWC) in Barcelona, SoundHound AI unveiled Sales Assist, an innovative real-time, voice-powered sales agent tailored for retail environments.

All news

AI Company

Launch your AI-powered team to automate Marketing, Sales & Growth

and get clients on autopilot — from social media and search engines. No ads needed

Begin getting your first leads today