Correctness is its own reward: bootstrapping error signals in self-guided reinforcement learning

Dependencies

The scripts were ran in Python 3.13.3 with NumPy 2.2.6, SciPy 1.16.0, PyTorch 2.7.0, Scikit-Learn 1.6.1, Matplotlib 3.10.1, seaborn 0.13.2, Numba 0.61.2 and tqdm 4.67.1.

Song preprocessing (optional; see below) requires Autoencoded Vocal Analysis 0.3.1 (AVA) and Joblib 1.5.0.

Instructions

This repository contains the code for simulating the models and generating the core figures in the paper Correctness is its own reward: bootstrapping error signals in self-guided reinforcement learning (bioRxiv).

The mathematical details are described in the paper. src contains the source code for the networks, the reinforcement learning environment, the sparse coding model, and other helper functions.

For the basic training and simulations of the models, check out examples in simulations, which also contains the code to generate the panels such as neuronal firing rates, and joint distributions of mean rates between correct and perturbed singing cases in the paper. Notice that the networks themselves are implemented to take any time series input that matches the dimensions, but the simulations to reproduce the figures in the paper uses sparse embeddings of real spectrograms of adult songs. To generate this embedding, follow the steps in realistic_auditory_processing or directly download the generated embeddings.

further_analysis contains the code to run and visualize the analysis of the learned recurrent weights, as well as the reinforcement learning experiments. robustness_tests contains the code to plot the cancellation quality over different parameters.

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
experiment_data		experiment_data
further_analysis		further_analysis
realistic_auditory_processing		realistic_auditory_processing
robustness_tests		robustness_tests
simulations		simulations
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Correctness is its own reward: bootstrapping error signals in self-guided reinforcement learning

Dependencies

Instructions

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Correctness is its own reward: bootstrapping error signals in self-guided reinforcement learning

Dependencies

Instructions

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages