AI can now generate CD-quality music from textual content, and it’s solely getting higher

A 3D illustration of a toy robot singing.

Enlarge (credit score: Getty Pictures)

Think about typing “dramatic intro music” and listening to a hovering symphony or writing “creepy footsteps” and getting high-quality sound results. That is the promise of Steady Audio, a text-to-audio AI mannequin introduced Wednesday by Stability AI that may synthesize music or sounds from written descriptions. Earlier than lengthy, related know-how could problem musicians for his or her jobs.

If you happen to’ll recall, Stability AI is the corporate that helped fund the creation of Steady Diffusion, a latent diffusion picture synthesis mannequin launched in August 2022. Not content material to restrict itself to producing pictures, the corporate branched out into audio by backing Harmonai, an AI lab that launched music generator Dance Diffusion in September.

Now Stability and Harmonai need to break into business AI audio manufacturing with Steady Audio. Judging by manufacturing samples, it looks like a big audio high quality improve from earlier AI audio turbines we have seen.

Learn 7 remaining paragraphs | Feedback

Leave a Reply

Your email address will not be published. Required fields are marked *