How Meta’s AI Generates Music Based on a Reference Melody | by Max Hilsdorf

[ad_1]

MusicGen, analyzed

On June 13th, 2023, Meta (formerly Facebook) made waves in the music and AI communities with the release of their generative music model, MusicGen. This model not only surpasses Google’s MusicLM, which was launched earlier this year, in terms of capabilities but is also trained on licensed music data and open-sourced for non-commercial use.

This means that you can not only read the research paper or listen to demos but also copy their code from GitHub or experiment with the model in a web app on HuggingFace.

In addition to generating audio from a text prompt, MusicGen can also generate music based on a given reference melody, a feature known as melody conditioning. In this blog post, I will demonstrate how Meta implemented this useful and fascinating functionality into their model. But before we delve into that, let’s first understand how melody conditioning works in practice.

Base Track

The following is a short electronic music snippet that I produced for this article. It features electronic drums, two dominant 808 bass and two syncopated synths. When listening to it, try to identify the “main melody” of the track.