Skip to content

Latest commit

 

History

History
116 lines (74 loc) · 9.43 KB

README.md

File metadata and controls

116 lines (74 loc) · 9.43 KB

MuE

This package provides a toolbox for building generative and predictive probabilistic models of biological sequences, as described in Weinstein and Marks (2021). It is implemented in the probabilistic programming language Edward2, with a Tensorflow backend. Note that an implementation of the MuE in Pyro is also available as part of the Pyro package, see docs and examples.

Installation

To install the package, create a new python 3.7 virtual environment (eg. using conda) and run:

pip install "git+https://github.com/debbiemarkslab/MuE.git#egg=MuE[extras]" --use-feature=2020-resolver

pip install "git+https://github.com/debbiemarkslab/edward2.git#egg=edward2"

This shouldn't take more than a few minutes. You can find out more about the package requirements in the setup.py file. We've provided a fork of the Edward2 repo since it does not yet have a stable release, but you can also install the newest version from the project repo https://github.com/google/edward2 to get the latest features.

To run the models at large scale, you will need to make sure you have access to a GPU with CUDA installed. See https://www.tensorflow.org/install/gpu for further support.

Demo MuE observation models

To run the example models, first clone this MuE repo and navigate to the models directory. Each model (FactorMuE.py and RegressMuE.py) is configured with a separate config file, such as examples/factor_config.cfg (for FactorMuE.py) and examples/regress_config.cfg (for RegressMuE.py). The config file controls the dataset, hyperparameters, training time, etc. Descriptions of all the options can be found in the config files themselves. The output of each run, including plots of point estimates of key parameters, is currently configured to be saved in the subdirectory examples/logs. You can inspect the ELBO optimization curve by running tenso