Stability AI Releases Stable Audio 3.0 With 6-Minute Music Generation

Stable Audio 3.0 is the first open-weight music generation model trained entirely on licensed data that permits unrestricted commercial use, addressing a central legal and ethical dispute in the AI music space.

Reporting from 1 sources: GameBusiness.jp.

Stability AI Releases Stable Audio 3.0 With 6-Minute Music Generation

Stability AI released Stable Audio 3.0, a new series of music generation AI models trained on licensed data from Universal Music Group and Warner Music Group. The models are open-weight and allow commercial use. The lineup includes models capable of generating songs over 6 minutes, with the Small variant supporting full on-device composition for the first time.

Stability AI has released Stable Audio 3.0, a new generation of music and audio generation AI models. The company trained the models on fully licensed data through partnerships with Universal Music Group and Warner Music Group, and released them as open-weight models. Users own the outputs and can use them commercially without restriction. The model lineup includes three open-weight variants: Small SFX for sound effects, Small for songs up to 2 minutes, and Medium for songs over 6 minutes. A Large model is available via API. Stability AI says the Small model is the first capable of full song composition on device and offline. The release also introduces LoRA fine-tuning for audio generation models for the first time, along with audio inpainting features for editing specific segments or extending audio. For organizations with annual revenue over $1 million, an enterprise license provides legal indemnification. A new product suite for musicians is under development.

  • 3.0 Small SFX: Open-weight model specialized for sound effect generation
  • 3.0 Small: Open-weight model capable of generating songs up to 2 minutes; first model to support full song composition on device
  • 3.0 Medium: Open-weight model capable of generating songs over 6 minutes
  • 3.0 Large: API model capable of generating songs over 6 minutes

Synthesized by Yomimono from the 1 cited source below, including Japanese-language reporting where cited, then editorially reviewed before publishing.

Sources