
Google researchers have made an AI that may generate minutes-long musical items from textual content prompts, and might even rework a whistled or hummed melody into different devices, much like how methods like DALL-E generate pictures from written prompts (by way of TechCrunch). The mannequin is known as MusicLM, and whilst you can’t mess around with it for your self, the corporate has uploaded a bunch of samples that it produced utilizing the mannequin.
The examples are spectacular. There are 30-second snippets of what sound like precise songs created from paragraph-long descriptions that prescribe a style, vibe, and even particular devices, in addition to five-minute-long items generated from one or two phrases like “melodic techno.” Maybe my favourite is a demo of “story…
Proceed studying…