GitHub - Mohnishi/KSNR: Koopman Spectrum Nonlinear Regulator and Provably Efficient Online Learning

Readme

This code is meant to reproduce the results found in Koopman Spectrum Nonlinear Regulator and Provably Efficient Online Learning by Motoya Ohnishi, Isao Ishikawa, Kendall Lowrey, Masahiro Ikeda, Sham Kakade, and Yoshinobu Kawahara. Some experiments require external licence, namely, MuJoCo. Refer to Lyceum MuJoCo for instructions on how to use MuJoCo under Lyceum platform.

Setup & Install

This code has been tested on Ubuntu 18.04, but should also work on different platforms (MacOS, Windows, FreeBSD) if the instructions are adapted.

The process to bring up this repo is as follows:

Download and install Julia
Navigate to project and instantiate
Run

The following is an example of installing Julia for Ubuntu 18.04.

cd ~/Downloads
wget https://julialang-s3.julialang.org/bin/linux/x64/1.5/julia-1.5.3-linux-x86_64.tar.gz
tar xvf julia-1.5.3-linux-x86_64.tar.gz

# the following exports can be added to your bashrc.
export JULIA_BINDIR=~/Downloads/julia-1.5.3/bin
export PATH=$JULIA_BINDIR:$PATH
export JULIA_NUM_THREADS=12

cd $directory_you_extracted_code
julia

Once you start Julia, regardless of platform, the following instructions may proceed:

julia> ]
(@v1.5) pkg> registry add https://github.com/Lyceum/LyceumRegistry     # add Lyceum registry
(@v1.5) pkg> activate .                        # activates this project
(KSNR) pkg> instantiate   # the built in package manager downloads, installs dependences
(KSNR) pkg> ctrl-c

julia> executescripts = false                      # to use our data to plot;   executescripts = true  is for running the algorithms
julia> include("main_1.jl")                      # to run the first experiment and/or plot the data  (limit-cycle experiment);  include("main_2.jl"), include("main_3.jl"), include("main_4.jl") does each experiment.

Notes

The results in the paper were generated with Julia 1.5.3, with 12 Julia threads. This is critical to reproducibility, but not necessary for running the included algorithm; one should adapt these settings to their compute.

Also, one may need to restart Julia to run experiments sequentially. To exit julia, do

julia> exit()

Next time you start Julia, you do not need to do instantiate but only activate.

Code Structure

.
├── log                    # Data store
│   ├── data1.jlso
│   ├── data2.jlso
│   └── ...
├── main_1.jl
├── main_2.jl
├── main_3.jl
├── main_4.jl
├── Manifest.toml          # Julia Manifest file for all dependencies
├── models           # Analytical dynamical system models
│   ├── cartpole.jl
│   └── singleint.jl
├── mujoco_models                    # MuJoCo models (require MuJoCo)
│   ├── cartpole_stable.jl
│   ├── cartpole_stable.xml
│   ├── walker2d.jl
│   ├── walker2d.xml
│   ├── reward.jl
│   └── common
│       ├── materials.xml
│       ├── skybox.xml
│       └── visual.xml
├── planner           # Heuristic planner algorithms
│   ├── MPPIClamp.jl
│   ├── PolicySelect.jl
│   ├── PolicySelect-GT.jl
│   └── PolicySelect-SP.jl
├── plot
├── Project.toml           # Julia Project file for top level dependencies
├── README.md              # This file
├── scripts                # Environment Hyper-Parameters and configuration/ Running
│   ├── cartpole_sim.jl
│   ├── learning.jl
│   ├── learning_gt.jl
│   ├── learning_run.jl
│   ├── singleint_sim.jl
│   └── walker2d_sim.jl
└── utils                  # Algorithm and support code
    ├── algorithm.jl
    ├── learned_env.jl
    ├── rff.jl
    └── weightmat.jl

Note walker2d is from OpenAI Gym and materials, skybox, visual, and cartpole are from DeepMind Control Suite

Code Maintenance

The codes are maintained by the authors of Koopman Spectrum Nonlinear Regulator and Provably Efficient Online Learning (arXiv). The project page can be found here

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Readme

Setup & Install

Notes

Code Structure

Code Maintenance

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
models		models
mujoco_models		mujoco_models
planner		planner
scripts		scripts
utils		utils
LICENSE		LICENSE
Manifest.toml		Manifest.toml
Project.toml		Project.toml
README.md		README.md
main_1.jl		main_1.jl
main_2.jl		main_2.jl
main_3.jl		main_3.jl
main_4.jl		main_4.jl

License

Mohnishi/KSNR

Folders and files

Latest commit

History

Repository files navigation

Readme

Setup & Install

Notes

Code Structure

Code Maintenance

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages