MUSIC SKETCHNET: CONTROLLABLE MUSIC GENERATION VIA FACTORIZED REPRESENTATIONS OF PITCH AND RHYTHM

Ke Chen, Cheng I. Wang, Taylor Berg-Kirkpatrick, Shlomo Dubnov

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

27 Scopus citations

Abstract

Drawing an analogy with automatic image completion systems, we propose Music SketchNet, a neural network framework that allows users to specify partial musical ideas guiding automatic music generation. We focus on generating the missing measures in incomplete monophonic musical pieces, conditioned on surrounding context, and optionally guided by user-specified pitch and rhythm snippets. First, we introduce SketchVAE, a novel variational autoencoder that explicitly factorizes rhythm and pitch contour to form the basis of our proposed model. Then we introduce two discriminative architectures, SketchInpainter and SketchConnector, that in conjunction perform the guided music completion, filling in representations for the missing measures conditioned on surrounding context and user-specified snippets. We evaluate SketchNet on a standard dataset of Irish folk music and compare with models from recent works. When used for music completion, our approach outperforms the state-of-the-art both in terms of objective metrics and subjective listening tests. Finally, we demonstrate that our model can successfully incorporate user-specified snippets during the generation process.

Original languageEnglish
Title of host publicationProceedings of the 21st International Society for Music Information Retrieval Conference, ISMIR 2020
EditorsJulie Cumming, Jin Ha Lee, Brian McFee, Markus Schedl, Johanna Devaney, Johanna Devaney, Cory McKay, Eva Zangerle, Timothy de Reuse
PublisherInternational Society for Music Information Retrieval
Pages77-84
Number of pages8
ISBN (Electronic)9780981353708
StatePublished - 1 Jan 2020
Externally publishedYes
Event21st International Society for Music Information Retrieval Conference, ISMIR 2020 - Virtual, Online, Canada
Duration: 11 Oct 202016 Oct 2020

Publication series

NameProceedings of the 21st International Society for Music Information Retrieval Conference, ISMIR 2020

Conference

Conference21st International Society for Music Information Retrieval Conference, ISMIR 2020
Country/TerritoryCanada
CityVirtual, Online
Period11/10/2016/10/20

ASJC Scopus subject areas

  • Music
  • Information Systems

Fingerprint

Dive into the research topics of 'MUSIC SKETCHNET: CONTROLLABLE MUSIC GENERATION VIA FACTORIZED REPRESENTATIONS OF PITCH AND RHYTHM'. Together they form a unique fingerprint.

Cite this