Abstract
This paper investigates the synthesis of audio textures and the structure of generative latent spaces using Variational Autoencoders (VAEs) within two paradigms of neural audio synthesis: DSP-inspired and data-driven approaches. For each paradigm, we propose VAE-based frameworks that allow fine-grained temporal control. We introduce datasets across three categories of environmental sounds to support our investigations. We evaluate and compare the models’ reconstruction performance using objective metrics, and investigate their generative capabilities and latent space structure through latent space interpolations.
| Original language | English |
|---|---|
| Pages (from-to) | 419-426 |
| Number of pages | 8 |
| Journal | Proceedings of the International Conference on Digital Audio Effects, DAFx |
| Publication status | Published - 2025 |
| Event | 28th International Conference on Digital Audio Effects, DAFx 2025 - Ancona, Italy Duration: 2 Sep 2025 → 5 Sep 2025 |
Fingerprint
Dive into the research topics of 'GENERATIVE LATENT SPACES FOR NEURAL SYNTHESIS OF AUDIO TEXTURES'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver