lang icon English
Nov. 25, 2024, 2:03 p.m.
2963

NVIDIA Unveils Fugatto: A Revolutionary AI for Sound and Music Generation

Brief news summary

NVIDIA has unveiled Fugatto, the Foundational Generative Audio Transformer Opus 1, pioneering AI technology for audio manipulation. This tool allows users to generate and edit audio, such as music and voices, simply through text prompts. Created by a team of international AI specialists, Fugatto excels in processing different accents and languages, aiming to replicate human-like sound generation, as explained by Rafael Valle from NVIDIA. Fugatto has diverse applications: music producers can swiftly create song prototypes, language learners can personalize audio content, and in gaming, it can adapt sounds to align with player actions while planning for complex audio effects. It also has the capacity to produce dynamic, evolving soundscapes. While information on Fugatto's release is not yet available, various other AI tools are on the market. Meta offers an open-source toolkit for converting text into sound, and Google's MusicLM provides text-to-music functionality through the AI Test Kitchen platform.

NVIDIA has introduced an experimental generative AI model called the Foundational Generative Audio Transformer Opus 1, or Fugatto. This model, described as a "Swiss Army knife for sound, " uses text prompts to generate or modify audio, music, voice, and sound files. Designed by a global team of researchers, its "multi-accent and multilingual capabilities" have been enhanced, according to NVIDIA. Rafael Valle, a researcher and manager of applied audio research at NVIDIA, stated, "We wanted to create a model that understands and generates sound like humans do. " The company suggests that Fugatto could assist music producers in swiftly generating song prototypes, allowing easy edits for different styles, voices, and instruments. Fugatto could also be utilized to generate voice materials for language learning tools, and video game developers might use it to create variations of assets based on player actions. Furthermore, researchers discovered that with some fine-tuning, Fugatto can perform tasks beyond its pre-training, like combining separate instructions to generate specific speech or sound scenarios, such as a specific accent and emotional tone, or birds singing in a thunderstorm.

Additionally, it can produce sounds that evolve over time, such as a shifting rainstorm. NVIDIA has not confirmed public access to Fugatto. However, it is not the first generative AI capable of sound creation from text prompts. Meta has released an open-source AI kit for sound generation, and Google offers a text-to-music AI, MusicLM, available through its AI Test Kitchen website.


Watch video about

NVIDIA Unveils Fugatto: A Revolutionary AI for Sound and Music Generation

Try our premium solution and start getting clients — at no cost to you

I'm your Content Creator.
Let’s make a post or video and publish it on any social media — ready?

Language

Content Maker

Our unique Content Maker allows you to create an SEO article, social media posts, and a video based on the information presented in the article

news image

Last news

The Best for your Business

Hot news

Oct. 31, 2025, 2:24 p.m.

Is Your Sales Team Guilty of AI-Washing? A CRO’s …

Around 2019, before AI’s surge, C-suite leaders primarily worried about ensuring sales executives accurately updated the CRM.

Oct. 31, 2025, 2:21 p.m.

Otterly.ai Emerges to Monitor AI Search Visibility

Otterly.ai, a pioneering Austrian software company, has recently attracted attention for its novel approach to monitoring brand and product representation within responses generated by large language models (LLMs).

Oct. 31, 2025, 2:19 p.m.

AI chipmaker Nvidia is the first $5 trillion comp…

Nvidia has recently become the first company to reach a $5 trillion market valuation, just three months after surpassing the $4 trillion mark.

Oct. 31, 2025, 2:18 p.m.

Scope AI's Quantum Resilient Technology Enhances …

Scope AI has unveiled a groundbreaking advancement in data security through the development of its quantum resilient entropy technology, known as QSE Technology.

Oct. 31, 2025, 2:16 p.m.

AI in Video Analytics: Unlocking Insights from Vi…

Artificial intelligence is dramatically reshaping video analytics by enabling the extraction of actionable insights from massive quantities of visual data.

Oct. 31, 2025, 2:09 p.m.

Two Insights into Future SMM Trends for 2026

The Year of Vibe Marketing and Human-Made Content AI continues to transform the world, altering audience expectations and redefining the roles of marketing professionals

Oct. 31, 2025, 10:40 a.m.

AI Video Personalization Enhances Online Advertis…

Advertisers are increasingly leveraging artificial intelligence (AI) to transform the creation and delivery of video advertisements.

All news

AI Company

Launch your AI-powered team to automate Marketing, Sales & Growth

and get clients on autopilot — from social media and search engines. No ads needed

Begin getting your first leads today