Wednesday, May 21, 2025
No Result
View All Result
newshub
  • Global news
  • Financial insights
    • Africa
    • Asia
    • Australia
    • Central Banks
    • China
    • Commodities
    • Europe
    • Banking
    • Corporate
    • Neobanking
    • Investment
    • Japan
    • South East Asia
    • Stock of the week
    • UK
    • US
  • Fin & tech
    • AI
    • Blockchain
    • Crypto
    • MSTRpay
    • Tech
  • Climate & energy
    • Climate
    • Carbon
    • Coal
    • Disruptive
    • Gas
    • Nuclear
    • Oil
    • Solar
    • Water
    • Waves
    • Wind
    • Renewable
    • South America
  • Lifestyle
    • Best chefs
    • Cocktail of the week
    • History
    • Influential women
  • WEX
    • Alt Kap Holding AB
    • Digital Network Holding, Inc.
    • Fantas-E AB
    • International Clean Energy Inc.
    • Intritum Partner Limited
    • Intritum Recycling GH Limited
    • MSTRpay AB
    • SWAP Services, Inc.
    • VMT Holding, Inc.
    • Universal Streaming Technologies – USTA
    • TC Unterhaltungselektronik AG
  • Global news
  • Financial insights
    • Africa
    • Asia
    • Australia
    • Central Banks
    • China
    • Commodities
    • Europe
    • Banking
    • Corporate
    • Neobanking
    • Investment
    • Japan
    • South East Asia
    • Stock of the week
    • UK
    • US
  • Fin & tech
    • AI
    • Blockchain
    • Crypto
    • MSTRpay
    • Tech
  • Climate & energy
    • Climate
    • Carbon
    • Coal
    • Disruptive
    • Gas
    • Nuclear
    • Oil
    • Solar
    • Water
    • Waves
    • Wind
    • Renewable
    • South America
  • Lifestyle
    • Best chefs
    • Cocktail of the week
    • History
    • Influential women
  • WEX
    • Alt Kap Holding AB
    • Digital Network Holding, Inc.
    • Fantas-E AB
    • International Clean Energy Inc.
    • Intritum Partner Limited
    • Intritum Recycling GH Limited
    • MSTRpay AB
    • SWAP Services, Inc.
    • VMT Holding, Inc.
    • Universal Streaming Technologies – USTA
    • TC Unterhaltungselektronik AG
No Result
View All Result
newshub
No Result
View All Result
ADVERTISEMENT

Stability AI unveils ‘Stable Audio’ model for controllable audio generation

2023/09/15/13:00
in AI
Reading Time: 3 mins read
247 5
A A
Stability AI unveils ‘Stable Audio’ model for controllable audio generation
MSTRpay MSTRpay MSTRpay
ADVERTISEMENT

Stability AI has introduced “Stable Audio,” a latent diffusion model designed to revolutionise audio generation.

This breakthrough promises to be another leap forward for generative AI and combines text metadata, audio duration, and start time conditioning to offer unprecedented control over the content and length of generated audio—even enabling the creation of complete songs.

Audio diffusion models traditionally faced a significant limitation in generating audio of fixed durations, often leading to abrupt and incomplete musical phrases. This was primarily due to the models being trained on random audio chunks cropped from longer files and then forced into predetermined lengths.

Stable Audio effectively tackles this historic challenge, enabling the generation of audio with specified lengths, up to the training window size.

One of the standout features of Stable Audio is its use of a heavily downsampled latent representation of audio, resulting in vastly accelerated inference times compared to raw audio. Through cutting-edge diffusion sampling techniques, the flagship Stable Audio model can generate 95 seconds of stereo audio at a 44.1 kHz sample rate in under a second utilising the power of an NVIDIA A100 GPU.

A sound foundation

The core architecture of Stable Audio comprises a variational autoencoder (VAE), a text encoder, and a U-Net-based conditioned diffusion model.

The VAE plays a pivotal role by compressing stereo audio into a noise-resistant, lossy latent encoding that significantly expedites both generation and training processes. This approach, based on the Descript Audio Codec encoder and decoder architectures, facilitates encoding and decoding of arbitrary-length audio while ensuring high-fidelity output.

To harness the influence of text prompts, Stability AI utilises a text encoder derived from a CLAP model specially trained on their dataset. This enables the model to imbue text features with information about the relationships between words and sounds. These text features, extracted from the penultimate layer of the CLAP text encoder, are integrated into the diffusion U-Net through cross-attention layers.

During training, the model learns to incorporate two key properties from audio chunks: the starting second (“seconds_start”) and the total duration of the original audio file (“seconds_total”). These properties are transformed into discrete learned embeddings per second, which are then concatenated with the text prompt tokens. This unique conditioning allows users to specify the desired length of the generated audio during inference.

The diffusion model at the heart of Stable Audio boasts a staggering 907 million parameters and leverages a sophisticated blend of residual layers, self-attention layers, and cross-attention layers to denoise the input while considering text and timing embeddings. To enhance memory efficiency and scalability for longer sequence lengths, the model incorporates memory-efficient implementations of attention.

ADVERTISEMENT

To train the flagship Stable Audio model, Stability AI curated an extensive dataset comprising over 800,000 audio files encompassing music, sound effects, and single-instrument stems. This rich dataset, furnished in partnership with AudioSparx – a prominent stock music provider – amounts to a staggering 19,500 hours of audio.

Stable Audio represents the vanguard of audio generation research, emerging from Stability AI’s generative audio research lab, Harmonai. The team remains dedicated to advancing model architectures, refining datasets, and enhancing training procedures. Their pursuit encompasses elevating output quality, fine-tuning controllability, optimising inference speed, and expanding the range of achievable output lengths.

Stability AI has hinted at forthcoming releases from Harmonai, teasing the possibility of open-source models based on Stable Audio and accessible training code.

WE/X WE/X WE/X
ADVERTISEMENT

This latest groundbreaking announcement follows a string of noteworthy stories about Stability. Earlier this week, Stability joined seven other prominent AI companies that signed the White House’s voluntary AI safety pledge as part of its second round.

You can try Stable Audio for yourself here.

Related Posts

Google unveils AI Mode: a conversational leap in search powered by Gemini 2.5
AI

Google unveils AI Mode: a conversational leap in search powered by Gemini 2.5

by newshub
11 hours ago

Google has officially introduced a major shift in how users interact with its search engine, unveiling a new feature called...

Read moreDetails
US tech firms strike AI deals as Trump tours Gulf states

US tech firms strike AI deals as Trump tours Gulf states

1 week ago
NVIDIA Dynamo: Scaling AI inference with open-source efficiency

NVIDIA Dynamo: Scaling AI inference with open-source efficiency

2 months ago
OpenAI and Musk agree to fast tracked trial over for-profit shift

OpenAI and Musk agree to fast tracked trial over for-profit shift

2 months ago
Oracle launches GenAI-based agents to fight financial crime

Oracle launches GenAI-based agents to fight financial crime

2 months ago
The role of Artificial Intelligence in personal finance: A game-changer for consumers

The role of Artificial Intelligence in personal finance: A game-changer for consumers

2 months ago
No Result
View All Result

Recent Posts

  • SEC charges Unicoin crypto platform over alleged $100 million fraud
  • Sailing across the Baltic: an idyllic voyage from Germany to Denmark
  • Why the universal banker model is still a work in progress
  • Trump’s approval rating slides amid legal woes and campaign controversies
  • Japan’s agriculture minister resigns after rice price gaffe sparks public outrage

Recent Comments

    Archives

    • May 2025
    • April 2025
    • March 2025
    • February 2025
    • January 2025
    • December 2024
    • November 2024
    • October 2024
    • September 2024
    • August 2024
    • July 2024
    • June 2024
    • May 2024
    • April 2024
    • March 2024
    • February 2024
    • January 2024
    • December 2023
    • November 2023
    • October 2023
    • September 2023
    • August 2023
    • July 2023
    • June 2023
    • May 2023
    • April 2023
    • March 2023
    • February 2023
    • January 2023
    • December 2022
    • November 2022
    • October 2022
    • September 2022
    • August 2022

    Categories

    • Africa
    • AI
    • An diesem Tag
    • Asia
    • Australia
    • Banking
    • Best chefs
    • Biden
    • Blockchain
    • Blockchain technology
    • Carbon
    • Central Banks
    • China
    • Climate
    • Climate & Energy
    • Coal
    • Cocktail of the week
    • Commodities
    • Corporate
    • Crypto
    • Deutsch
    • Deutsch PR
    • English PR
    • Europe
    • Financial insights
    • Focus on neobanking
    • Gas
    • Global news
    • Harris
    • History
    • India
    • Influential women
    • Invest and Rest
    • Italiano PR
    • Japan
    • Lifestyle
    • Metaverse
    • MSTRpay
    • Neobanking
    • News
    • newshub special
    • newshub-special
    • NFT
    • Nobel Prizes 2024
    • Nuclear
    • Oil
    • Press
    • Press releases
    • Pressroom
    • Renewable
    • Russia
    • Solar
    • South America
    • South East Asia
    • Stock of the week
    • Stocks
    • Svensk PR
    • Tech
    • Trump
    • Trump trials
    • UFO
    • UK
    • UK News
    • Ukraine
    • US
    • US politics
    • Waves
    • WEX
    • Wind

    Meta

    • Log in
    • Entries feed
    • Comments feed
    • WordPress.org

    Recent Posts

    • SEC charges Unicoin crypto platform over alleged $100 million fraud
    • Sailing across the Baltic: an idyllic voyage from Germany to Denmark
    • Why the universal banker model is still a work in progress
    • Trump’s approval rating slides amid legal woes and campaign controversies
    • Japan’s agriculture minister resigns after rice price gaffe sparks public outrage

    Categories

    • Africa
    • AI
    • An diesem Tag
    • Asia
    • Australia
    • Banking
    • Best chefs
    • Biden
    • Blockchain
    • Blockchain technology
    • Carbon
    • Central Banks
    • China
    • Climate
    • Climate & Energy
    • Coal
    • Cocktail of the week
    • Commodities
    • Corporate
    • Crypto
    • Deutsch
    • Deutsch PR
    • English PR
    • Europe
    • Financial insights
    • Focus on neobanking
    • Gas
    • Global news
    • Harris
    • History
    • India
    • Influential women
    • Invest and Rest
    • Italiano PR
    • Japan
    • Lifestyle
    • Metaverse
    • MSTRpay
    • Neobanking
    • News
    • newshub special
    • newshub-special
    • NFT
    • Nobel Prizes 2024
    • Nuclear
    • Oil
    • Press
    • Press releases
    • Pressroom
    • Renewable
    • Russia
    • Solar
    • South America
    • South East Asia
    • Stock of the week
    • Stocks
    • Svensk PR
    • Tech
    • Trump
    • Trump trials
    • UFO
    • UK
    • UK News
    • Ukraine
    • US
    • US politics
    • Waves
    • WEX
    • Wind

    Archives

    • May 2025
    • April 2025
    • March 2025
    • February 2025
    • January 2025
    • December 2024
    • November 2024
    • October 2024
    • September 2024
    • August 2024
    • July 2024
    • June 2024
    • May 2024
    • April 2024
    • March 2024
    • February 2024
    • January 2024
    • December 2023
    • November 2023
    • October 2023
    • September 2023
    • August 2023
    • July 2023
    • June 2023
    • May 2023
    • April 2023
    • March 2023
    • February 2023
    • January 2023
    • December 2022
    • November 2022
    • October 2022
    • September 2022
    • August 2022
    WE/X WE/X WE/X
    newshub

    © 2023-2025
    A part of MSTRpay
    MSTRpay
    Legal & Disclosure

    • Global news
    • Financial insights
    • Fin & tech
    • Climate & energy
    • Lifestyle
    • WEX

    Welcome Back!

    Login to your account below

    Forgotten Password?

    Retrieve your password

    Please enter your username or email address to reset your password.

    Log In
    Please enter CoinGecko Free Api Key to get this plugin works.

    Add New Playlist

    No Result
    View All Result
    • Global news
    • Financial insights
      • Africa
      • Asia
      • Australia
      • Central Banks
      • China
      • Commodities
      • Europe
      • Banking
      • Corporate
      • Neobanking
      • Investment
      • Japan
      • South East Asia
      • Stock of the week
      • UK
      • US
    • Fin & tech
      • AI
      • Blockchain
      • Crypto
      • MSTRpay
      • Tech
    • Climate & energy
      • Climate
      • Carbon
      • Coal
      • Disruptive
      • Gas
      • Nuclear
      • Oil
      • Solar
      • Water
      • Waves
      • Wind
      • Renewable
      • South America
    • Lifestyle
      • Best chefs
      • Cocktail of the week
      • History
      • Influential women
    • WEX
      • Alt Kap Holding AB
      • Digital Network Holding, Inc.
      • Fantas-E AB
      • International Clean Energy Inc.
      • Intritum Partner Limited
      • Intritum Recycling GH Limited
      • MSTRpay AB
      • SWAP Services, Inc.
      • VMT Holding, Inc.
      • Universal Streaming Technologies – USTA
      • TC Unterhaltungselektronik AG

    © 2023-2025
    A part of MSTRpay
    MSTRpay
    Legal & Disclosure