NVIDIA Unveils Fugatto: A Revolutionary AI for Sound and Music Generation

NVIDIA has introduced an experimental generative AI model called the Foundational Generative Audio Transformer Opus 1, or Fugatto. This model, described as a "Swiss Army knife for sound, " uses text prompts to generate or modify audio, music, voice, and sound files. Designed by a global team of researchers, its "multi-accent and multilingual capabilities" have been enhanced, according to NVIDIA. Rafael Valle, a researcher and manager of applied audio research at NVIDIA, stated, "We wanted to create a model that understands and generates sound like humans do. " The company suggests that Fugatto could assist music producers in swiftly generating song prototypes, allowing easy edits for different styles, voices, and instruments. Fugatto could also be utilized to generate voice materials for language learning tools, and video game developers might use it to create variations of assets based on player actions. Furthermore, researchers discovered that with some fine-tuning, Fugatto can perform tasks beyond its pre-training, like combining separate instructions to generate specific speech or sound scenarios, such as a specific accent and emotional tone, or birds singing in a thunderstorm.
Additionally, it can produce sounds that evolve over time, such as a shifting rainstorm. NVIDIA has not confirmed public access to Fugatto. However, it is not the first generative AI capable of sound creation from text prompts. Meta has released an open-source AI kit for sound generation, and Google offers a text-to-music AI, MusicLM, available through its AI Test Kitchen website.
Brief news summary
NVIDIA has unveiled Fugatto, the Foundational Generative Audio Transformer Opus 1, pioneering AI technology for audio manipulation. This tool allows users to generate and edit audio, such as music and voices, simply through text prompts. Created by a team of international AI specialists, Fugatto excels in processing different accents and languages, aiming to replicate human-like sound generation, as explained by Rafael Valle from NVIDIA. Fugatto has diverse applications: music producers can swiftly create song prototypes, language learners can personalize audio content, and in gaming, it can adapt sounds to align with player actions while planning for complex audio effects. It also has the capacity to produce dynamic, evolving soundscapes. While information on Fugatto's release is not yet available, various other AI tools are on the market. Meta offers an open-source toolkit for converting text into sound, and Google's MusicLM provides text-to-music functionality through the AI Test Kitchen platform.
AI-powered Lead Generation in Social Media
and Search Engines
Let AI take control and automatically generate leads for you!

I'm your Content Manager, ready to handle your first test assignment
Learn how AI can help your business.
Let’s talk!

Amazon CEO Warns of AI-Driven Job Reductions in C…
Amazon CEO Andy Jassy has issued a significant warning about the company’s future workforce strategy amid its growing integration of artificial intelligence (AI) across operations.

Bitcoin Treasury Companies Are an Auditor's Night…
Bitcoin treasury companies’ auditing practices have recently come under intense scrutiny, revealing major transparency and verification challenges within this burgeoning sector.

Justin Sun's Tron to Go Public via Reverse Merger
Justin Sun, founder of the $26 billion Tron blockchain ecosystem, announced plans to take Tron public via a reverse merger with Nasdaq-listed SRM Entertainment, marking a pivotal step in Tron's growth and visibility in financial and tech sectors.

Top Trump Labor Official: America's Workers Don't…
Keith Sonderling, former deputy Labor Secretary under the Trump administration, recently highlighted a major barrier to AI adoption in the U.S. workforce: employee mistrust.

Avail Goes Full Stack To Capture $300 Billion Glo…
June 17, 2025 – Dubai, United Arab Emirates Avail presents the only blockchain stack that delivers horizontal scalability, crosschain connectivity, and unified liquidity while preserving decentralization

Microsoft and OpenAI Engage in Complex Negotiatio…
Microsoft and OpenAI are currently engaged in a complex and tense negotiation process that could significantly reshape their strategic partnership and affect the broader artificial intelligence industry.

Crypto group Tron to go public in US via reverse-…
Hong Kong-based cryptocurrency entrepreneur Justin Sun’s blockchain company, Tron, is preparing to go public in the United States through a reverse merger with SRM Entertainment (SRM.O).