Skip to main navigation Skip to search Skip to main content

GENERATIVE LATENT SPACES FOR NEURAL SYNTHESIS OF AUDIO TEXTURES

Research output: Contribution to journalConference articlepeer-review

Abstract

This paper investigates the synthesis of audio textures and the structure of generative latent spaces using Variational Autoencoders (VAEs) within two paradigms of neural audio synthesis: DSP-inspired and data-driven approaches. For each paradigm, we propose VAE-based frameworks that allow fine-grained temporal control. We introduce datasets across three categories of environmental sounds to support our investigations. We evaluate and compare the models’ reconstruction performance using objective metrics, and investigate their generative capabilities and latent space structure through latent space interpolations.

Original languageEnglish
Pages (from-to)419-426
Number of pages8
JournalProceedings of the International Conference on Digital Audio Effects, DAFx
Publication statusPublished - 2025
Event28th International Conference on Digital Audio Effects, DAFx 2025 - Ancona, Italy
Duration: 2 Sep 20255 Sep 2025

Fingerprint

Dive into the research topics of 'GENERATIVE LATENT SPACES FOR NEURAL SYNTHESIS OF AUDIO TEXTURES'. Together they form a unique fingerprint.

Cite this