Self-Expanding Neural Networks

This repository contains all code necessary to reproduce our experiments from the main body of the paper "Self-Expanding Neural Networks". There are two codebases, one for CNN experiments and one for MLP experiments. They are located in the senn_cnn and senn_mlp folders respectively.

Setting up the CNN Codebase

Setting up the CNN codebase requires to navigate to the senn_cnn and proceed with the requirements and environment variables as described below.

Installing Requirements

The requirements can be installed using PDM. To do so, first install PDM, then run pdm install to install the requirements. This will setup a virtual environment with all dependencies. We provide a Dockerfile as well which should be used after setting up pdm.

Docker

In order ease reproduction, we provide a Dockerfile. We also provide a docker-compose service to setup the docker container. To use Docker, first install Docker and docker-compose, then run docker compose build to build the container. Then run docker compose up -d to start the container. This will start a container with all dependencies installed.

PDM

It is possible to install the requirements using PDM, a python package manager. To do so, first install PDM, then run pdm install to install the requirements. This will setup a virtual environment with all dependencies.

Environment Variables

The following environment variables are used: CUDA_VISIBLE_DEVICES: Specifies which GPU to use. DATASETS_ROOT_DIR: Specifies the root directory for datasets. WANDB_DIR: Specifies the root directory for wandb logs. RTPT_INITIALS: Specifies the initials to use for RTPT. WANDB_API_KEY: Specifies the API key to use for wandb. This is only necessary if you want to log to wandb. If using Docker, these may be set in the docker-compose.yml file, or by creating a .env file in the root directory. If using a .env file create it before running docker compose up -d.

Running Experiments

To run an experiment, simply run the corresponding script. If using Docker, this may be done by running docker compose exec -e CUDA_VISIBLE_DEVICES -e JAX_PLATFORMS main python experiments/{experiment}.py where experiment is on of:

senn_cifar10_manycycle
senn_cifar10_onecycle
transfer_cifar10_pretrain
transfer_tinyimagenet_fixed
transfer_tinyimagenet_senn If using pdm only, source the virtual environment, then run python experiments/{experiment}.py.

Transfer Learning Experiments

The transfer learning experiments save a model checkpoint after training. This checkpoint is then loaded and used to initialize the model for the transfer learning task. In docker the directory is /senn/orbax/pretrained/final. If you want to alter the checkpoint dir you can change the CHECKPOINT_DIR variable in each experiment file.

Experiment Settings

The hyperparameters and tasks may be varied by editing the settings in the corresponding experiment file.

Changing the random seed

The random seeds used for the model initialization training can be altered in the experiment script.

Setting up the MLP Codebase

You can setup the MLP codebase by navigating to the senn_mlp folder and proceeding with the requirements and environment variables as described below.

Installing Requirements

We provide two methods to quickly install all dependencies required for our experiments.

Docker

In order ease reproduction, we provide a Dockerfile and scripts with which to use it.

To build the container run sh build.sh in the main ("senn") directory.
In order to run the container on gpu 0 execute the command ./use 0, where this argument may be omitted if using cpu.
To start a container with port forwarding for tensorboard use sh tboard.sh

Pip

It is of course possible to install dependencies with pip, without using Docker:

Install JAX - follow instructions in official repository, e.g. for cuda as of May 2023: pip install --upgrade "jax[cuda11_local]" -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html.
Install PyTorch and Torchvision - instructions at official website, e.g. as of May 2023: pip install --upgrade torch torchvision --index-url https://download.pytorch.org/whl/cu118. Make sure to install a version with CUDA version requirements compatible with JAX.
Install other requirements from requirements.txt: pip install -r requirements.txt

Running Experiments

If you are not using Docker, adjust the "checkpoints", "datasets" and "logs" folders in config.yaml to your liking. To run an experiment, simply execute the corresponding script, e.g. for experiment 4: python experiment4.py --name my_experiment. The "--name my_experiment" argument specifies that tensorboard will store any logs under that name.

Tensorboard Results

Various metrics, such as training/validation loss and neuron count will be logged during training. If using Docker, these metrics may be viewed by starting a container with sh tboard.sh and then running the command tensorboard --logdir /logs inside it. If Docker is not used, then simply run tensorboard --logdir ./mylogs, replacing "./mylogs" with the directory chosen in "config.yaml" for "meta:logdir".

Experiment Settings

The hyperparameters and tasks may be adjusted in the "default_config.yaml" in the folder corresponding to the experiment e.g. for experiment 4 one would adjust "senn/experiment4/default_config.yaml".

Changing the random seed

The random seed used during an experiment may be varied by altering "meta:seed", or by passing the commandline argument --seed, e.g. python experiment4.py --name my_experiment --seed 2.

Subset size in experiment 4

In experiment 4 there is an additional replication relevant setting, "data:defaults:N", initially set to 4800. This specifies the number of examples from each class which will be included in the subset of MNIST trained on. For example, N=6000 would result in training on the full 60000 images.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
senn_cnn		senn_cnn
senn_mlp		senn_mlp
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-Expanding Neural Networks

Setting up the CNN Codebase

Installing Requirements

Docker

PDM

Environment Variables

Running Experiments

Transfer Learning Experiments

Experiment Settings

Changing the random seed

Setting up the MLP Codebase

Installing Requirements

Docker

Pip

Running Experiments

Tensorboard Results

Experiment Settings

Changing the random seed

Subset size in experiment 4

About

Releases

Packages

Languages

License

ml-research/self-expanding-neural-networks

Folders and files

Latest commit

History

Repository files navigation

Self-Expanding Neural Networks

Setting up the CNN Codebase

Installing Requirements

Docker

PDM

Environment Variables

Running Experiments

Transfer Learning Experiments

Experiment Settings

Changing the random seed

Setting up the MLP Codebase

Installing Requirements

Docker

Pip

Running Experiments

Tensorboard Results

Experiment Settings

Changing the random seed

Subset size in experiment 4

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages