NVIDIA has introduced an experimental generative AI model called the Foundational Generative Audio Transformer Opus 1, or Fugatto. This model, described as a "Swiss Army knife for sound, " uses text prompts to generate or modify audio, music, voice, and sound files. Designed by a global team of researchers, its "multi-accent and multilingual capabilities" have been enhanced, according to NVIDIA. Rafael Valle, a researcher and manager of applied audio research at NVIDIA, stated, "We wanted to create a model that understands and generates sound like humans do. " The company suggests that Fugatto could assist music producers in swiftly generating song prototypes, allowing easy edits for different styles, voices, and instruments. Fugatto could also be utilized to generate voice materials for language learning tools, and video game developers might use it to create variations of assets based on player actions. Furthermore, researchers discovered that with some fine-tuning, Fugatto can perform tasks beyond its pre-training, like combining separate instructions to generate specific speech or sound scenarios, such as a specific accent and emotional tone, or birds singing in a thunderstorm.
Additionally, it can produce sounds that evolve over time, such as a shifting rainstorm. NVIDIA has not confirmed public access to Fugatto. However, it is not the first generative AI capable of sound creation from text prompts. Meta has released an open-source AI kit for sound generation, and Google offers a text-to-music AI, MusicLM, available through its AI Test Kitchen website.
NVIDIA Unveils Fugatto: A Revolutionary AI for Sound and Music Generation
Around 2019, before AI’s surge, C-suite leaders primarily worried about ensuring sales executives accurately updated the CRM.
Otterly.ai, a pioneering Austrian software company, has recently attracted attention for its novel approach to monitoring brand and product representation within responses generated by large language models (LLMs).
Nvidia has recently become the first company to reach a $5 trillion market valuation, just three months after surpassing the $4 trillion mark.
Scope AI has unveiled a groundbreaking advancement in data security through the development of its quantum resilient entropy technology, known as QSE Technology.
Artificial intelligence is dramatically reshaping video analytics by enabling the extraction of actionable insights from massive quantities of visual data.
The Year of Vibe Marketing and Human-Made Content AI continues to transform the world, altering audience expectations and redefining the roles of marketing professionals
Advertisers are increasingly leveraging artificial intelligence (AI) to transform the creation and delivery of video advertisements.
Launch your AI-powered team to automate Marketing, Sales & Growth
and get clients on autopilot — from social media and search engines. No ads needed
Begin getting your first leads today