Designing Neural Synthesizers for Low-Latency Interaction

paper - audio examples - NAS evaluation package - low-latency plugin

This repo contains the official Pytorch implementaiton of BRAVE a low-latency audio variational autoencoder for instrumental performance. It also implements all of the other models tested on the paper.

Check the evaluation directory for info on replicating the results of the paper.

Install

We use the acids-rave package for preprocessing the audio datasets and training the models.

pip install h5py acids-rave==2.3 # may work with lower versions too.
conda install ffmpeg
git clone https://github.com/fcaspe/BRAVE
cd BRAVE

Preparing Dataset

We use the same rave preprocess tool as RAVE for dataset preparation. Also, RAVE datasets will work with this repo's models. Check RAVE's info on dataset preparation.

rave preprocess --input_path /audio/folder --output_path path/to/preprocessed/dataset/ --channels X

Training

We use the same rave train CLI for training. Make sure to specify with --config a path to one of the .gin configs provided in this repo. For instance, to train BRAVE:

rave train --config ./configs/brave.gin --name my_brave_run --db_path path/to/preprocessed/dataset/

Exporting BRAVE for Real-Time Inference

Low-latency BRAVE Plugin

The BRAVE plugin can run BRAVE models at < 10 ms latency and low jitter (~3 ms).

Use the export_brave_plugin.py utility to export a trained model. This requires a BRAVE checkpoint (.ckpt) created with rave train.
It does not work with models exported to TorchScript (.ts).

python ./scripts/export_brave_plgin.py --model path/to/model_checkpoint.ckpt --output_path ./exported_model.h5

NOTE: BRAVE works better when run at its original sampling rate. For best results, make sure that you run the plugin at the same sample rate as the data used to train it.

Standard RAVE Export Method

BRAVE is compatible with many creative coding tools and plugins that use RAVE models. You can export a BRAVE model to work with some great tools created by the community, such as:

nn~ for Max-MSP & PureData
SuperCollider UGen
IRCAM's RAVE VST
And probably some more

Please note these might show higher latency than the BRAVE Plugin due to a different audio buffering strategy.

rave export --run path/to/model_checkpoint.ckpt

This will store a TorchScript (.ts) model next to the checkpoint file which you can load on your selected application.

Cite Us

If you find this work useful please consider citing our paper:

@article{caspe2025designing,
    title={{Designing Neural Synthesizers for Low-Latency Interaction}},
    author={Caspe, Franco and Shier, Jordie and Sandler, Mark and Saitis, Charis and McPherson, Andrew},
    journal={Journal of the Audio Engineering Society},
    year={2025}
}

Name	Name	Last commit message	Last commit date
Latest commit fcaspe Updated README.md Apr 11, 2025 9f94750 · Apr 11, 2025 History 17 Commits
configs	configs	Adding configs	Jan 9, 2025
evaluation	evaluation	Updated readmes	Mar 6, 2025
img	img	Adding configs	Jan 9, 2025
scripts	scripts	Updated readme	Feb 25, 2025
.gitignore	.gitignore	Initial commit	Jan 9, 2025
LICENSE	LICENSE	Adding configs	Jan 9, 2025
README.md	README.md	Updated README.md	Apr 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Designing Neural Synthesizers for Low-Latency Interaction

paper - audio examples - NAS evaluation package - low-latency plugin

Install

Preparing Dataset

Training

Exporting BRAVE for Real-Time Inference

Low-latency BRAVE Plugin

Standard RAVE Export Method

Cite Us

About

Contributors 2

Languages

License

fcaspe/BRAVE

Folders and files

Latest commit

History

Repository files navigation

Designing Neural Synthesizers for Low-Latency Interaction

paper - audio examples - NAS evaluation package - low-latency plugin

Install

Preparing Dataset

Training

Exporting BRAVE for Real-Time Inference

Low-latency BRAVE Plugin

Standard RAVE Export Method

Cite Us

About

Resources

License

Stars

Watchers

Forks

Contributors 2

Languages