e-space
Manchester Metropolitan University's Research Repository

    Style-based drum synthesis with GAN inversion

    Drysdale, Jake, Tomczak, Maciej and Hockman, Jason (2021) Style-based drum synthesis with GAN inversion. In: 22nd International Society of Music Information Retrieval Conference, 07 November 2021 - 12 November 2021, Online.

    [img]
    Preview
    Published Version
    Available under License Creative Commons Attribution.

    Download (1MB) | Preview

    Abstract

    Neural audio synthesizers exploit deep learning as an alternative to traditional synthesizers that generate audio from hand-designed components such as oscillators and wavetables. For a neural audio synthesizer to be applicable to music creation, meaningful control over the output is essential. This paper provides an overview of an unsupervised approach to deriving useful feature controls learned by a generative model. A system for generation and transformation of drum samples using a style-based generative adversarial network (GAN) is proposed. The system provides functional control of audio style features, based on principal component analysis (PCA) applied to the intermediate latent space. Additionally, we propose the use of an encoder trained to invert input drums back to the latent space of the pre-trained GAN. We experiment with three modes of control and provide audio results on a supporting website.

    Impact and Reach

    Statistics

    Activity Overview
    6 month trend
    9Downloads
    6 month trend
    18Hits

    Additional statistics for this dataset are available via IRStats2.

    Repository staff only

    Edit record Edit record