Authors: Ostermann, Fabian
Vatolkin, Igor
Ebeling, Martin
Title: AAM: a dataset of Artificial Audio Multitracks for diverse music information retrieval tasks
Language (ISO): en
Abstract: We present a new dataset of 3000 artificial music tracks with rich annotations based on real instrument samples and generated by algorithmic composition with respect to music theory. Our collection provides ground truth onset information and has several advantages compared to many available datasets. It can be used to compare and optimize algorithms for various music information retrieval tasks like music segmentation, instrument recognition, source separation, onset detection, key and chord recognition, or tempo estimation. As the audio is perfectly aligned to original MIDIs, all annotations (onsets, pitches, instruments, keys, tempos, chords, beats, and segment boundaries) are absolutely precise. Because of that, specific scenarios can be addressed, for instance, detection of segment boundaries with instrument and key change only, or onset detection only in tracks with drums and slow tempo. This allows for the exhaustive evaluation and identification of individual weak points of algorithms. In contrast to datasets with commercial music, all audio tracks are freely available, allowing for extraction of own audio features. All music pieces are stored as single instrument audio tracks and a mix track, so that different augmentations and DSP effects can be applied to extend training sets and create individual mixes, e.g., for deep neural networks. In three case studies, we show how different algorithms and neural network models can be analyzed and compared for music segmentation, instrument recognition, and onset detection. In future, the dataset can be easily extended under consideration of specific demands to the composition process.
Subject Headings: Artificial music dataset
Multitrack audio mixes
Algorithmic composition
Music segmentation
Instrument recognition
Source separation
Onset detection
Tempo estimation
Chord detection
Subject Headings (RSWK): Datensatz
Komposition <Musik>
Issue Date: 2023-03-23
Rights link:
Appears in Collections:LS 11

Files in This Item:
File Description SizeFormat 
s13636-023-00278-7.pdfDNB1.7 MBAdobe PDFView/Open

This item is protected by original copyright

This item is licensed under a Creative Commons License Creative Commons