+91-904-002-3003 (IN) +91-923-730-4004 (IN) +1 (646) 916-3476 (USA)

Stable Audio: A New Latent Diffusion Model for Controllable Audio Generation

October 6, 2023 Team ME ScholarHangout

Getting your Trinity Audio player ready...

Stability AI has unveiled Stable Audio, a latent diffusion model designed for controllable audio generation. Stable Audio combines text metadata, audio duration, and start time conditioning to offer unprecedented control over the content and length of generated audio, even enabling the creation of complete songs.

Stable Audio addresses a significant limitation of previous audio diffusion models, which were unable to generate audio of specified durations. This was due to the models being trained on random audio chunks and forced into predetermined lengths. Stable Audio overcomes this challenge by using a heavily downsampled latent representation of audio, which enables vastly accelerated inference times and allows the model to generate audio of arbitrary lengths.

Our Services

English Editing
Publication Support
Writing & Rewriting
Research Support
Customized Services

KYOTO, JAPAN

Global Marketing Association Co. Ltd,
8th Floor, ASTEM Bldg. Kyoto Research Park,
134 Chudoji Minamimachi, Shimogyo-ku, Kyoto-city,
KYOTO - 600883, JAPAN
support_japan@manuscriptedit.jp
www.manuscriptedit.jp

NC, DURHAM, USA

2530 Meridian Parkway, Suite 300,
Durham, NC, 27713, United States of America

MAIDSTONE, UK

26 Kings Hill Avenue, Kings Hill, West Malling,
Maidstone, ME19 4AE

BHUBANESWAR, INDIA

DCB-401,4th Floor,DLF Cyber City,
Chandaka Industrial Estate,Patia,
Bhubaneswar-751024, Odisha, INDIA

BANGALORE, INDIA

Level 9 Raheja Towers,
26-27 Mahatma Gandhi Road,
Bangalore-560 001, INDIA

Refund & Cancellation / Privacy Policy / Terms & Conditions / Author Service agreement / Editor Service agreement

Leave a Reply Cancel reply