top of page

Musdb18 Dataset

The musdb18 dataset was chosen as our starting point for our stem separation project. The data set consists of a total of 150 different songs split into a training subset consisting of 100 songs and a testing subset consisting of 50 songs. In total, there's approximately 10 hours worth of music from multiple genres. For each song in the dataset, we have access to a .wav file for the song in its entirety, which is a mixture of the constituent stems. Conveniently, we also have access to .wav files for the individual vocals, bass, drums, and "other" stems. The "other" stem is a mixture of any other instrumentation filling out the song. This dataset has proven to be extremely useful for our purposes as we have been able to start with the simple case of the separation of two stems and work towards the separation of three stems. Moreover, the individual stems can be treated as the ground truth for evaluating our source separation algorithms.

​

Each .wav file can be read into Matlab where it is represented by two vectors corresponding to the left and right channel of the .wav file respectively. In this fashion, each song is composed of five stereo streams: the mixture, the drums, the bass, the vocals, and the rest of the accompaniment. Each stereo stream is sampled at a frequency of 44.1 kHz. The mixture is separately encoded as AAC at 44.1 kHz and allows for a small difference between the sum of all individual sources and the entire song mixture. For this project, we are only considering the vocals, bass, and drums stems. By summing the individual stems together, we can create our own mixtures to feed into our source separation algorithms.

​

For the evaluation of our source separation algorithms, we focused on the vocals, bass, and drums mixture for six songs in the data set. Those songs are "Music Delta - Beatles", "Music Delta - Hendrix", "Music Delta - Rock", "BKS - Bulldozer", "Carlos Gonzalez - A Place For Us" and "Motor Tapes - Shore". We felt these samples were varied enough that they would be a good dataset to evaluate our performance. Attached below are the ground truth stems (vocals, bass, drums, and the three stem mixture) for each of these songs that we used for our testing. For the songs "BKS - Bulldozer", "Carlos Gonzalez - A Place For Us", and "Motor Tapes - Shore", we use a snippet of each song rather than the entire song to accelerate evaluation.

Music Delta - Beatles (Mix)
00:00 / 00:36
Music Delta - Beatles (Vocals)
00:00 / 00:36
Music Delta - Beatles (Bass)
00:00 / 00:36
Music Delta - Beatles (Drums)
00:00 / 00:36

Figure 1: Spectrogram and audio samples for Music Delta - Beatles

Music Delta - Hendrix (Mix)
00:00 / 00:20
Music Delta - Hendrix (Vocals)
00:00 / 00:20
Music Delta - Hendrix (Bass)
00:00 / 00:20
Music Delta - Hendrix (Drums)
00:00 / 00:20

Figure 2: Spectrogram and audio samples for Music Delta - Hendrix

Music Delta - Rock (Mix)
00:00 / 00:13
Music Delta - Rock (Vocals)
00:00 / 00:13
Music Delta - Rock (Bass)
00:00 / 00:13
Music Delta - Rock (Drums)
00:00 / 00:13

Figure 3: Spectrogram and audio samples for Music Delta - Rock

BKS - Bulldozer (Mix)
00:00 / 00:22
BKS - Bulldozer (Vocals)
00:00 / 00:22
BKS - Bulldozer (Bass)
00:00 / 00:22
BKS - Bulldozer (Drums)
00:00 / 00:22

Figure 4: Spectrogram and audio samples for BKS - Bulldozer

Carlos Gonzalez - A Place For Us (Mix)
00:00 / 00:23
Carlos Gonzalez - A Place For Us (Vocals)
00:00 / 00:23
Carlos Gonzalez - A Place For Us (Bass)
00:00 / 00:23
Carlos Gonzalez - A Place For Us (Drums)
00:00 / 00:23

Figure 5: Spectrogram and audio samples for Carlos Gonzalez - A Place For Us

Motor Tapes - Shore (Mix)
00:00 / 00:34
Motor Tapes - Shore (Vocals)
00:00 / 00:34
Motor Tapes - Shore (Bass)
00:00 / 00:34
Motor Tapes - Shore (Drums)
00:00 / 00:34

Figure 6: Spectrogram and audio samples for Motor Tapes - Shore

EECS351 Stem Separation

©2023 by EECS351 Stem Separation. Proudly created with Wix.com

bottom of page