Skip to content

Latest commit

 

History

History
50 lines (34 loc) · 1.65 KB

README.md

File metadata and controls

50 lines (34 loc) · 1.65 KB

notebook_helpers

Scripts for working with deployable, reproducible ipython notebooks.

Overview

Serve private IPython notebooks and ensure reproducibility, in two acts.

I. Development: Strip outputs from notebook before committing (using pre-commit hook)

II. Deployment: Re-run notebook logic and generate html on display server (using rake)

Development Setup

You'll wish to use this directory as your workspace. That is, when creating a new notebook, ipython notebook from within this directory, so that notebooks are stored here.

Run rake setup_pre_commit_hook in this directory, on your dev machine, to enable commit-time output checks. These checks will ensure that all notebooks checked in contain only code, not data.

Deployment Setup

To deploy, copy this directory to your server, and then run rake build_notebooks, on your server as well. This will re-run all notebook logic, and create rendered html files in a nbviewer subdir of the deployed directory. Generated html files will be cached between runs under /tmp/nbviewer. You can then serve the nbviewer subdirectory statically to display the built notebooks.

Additional Rakefile Directions

Each ipython notebook may have an accompanying rakefile with a matching name, e.g. df.ipynb and df.rake. That rakefile should declare a task :deps, which will be run to fetch any dependencies for the notebook.

file :some_data do
    sh "wget go/some_data"
end

file :other_data do
    sh "impala -f query.sql -o other_data"
end

file :joined_data => [:some_data, :other_data] do
    sh "join some_data other_data > joined_data"
end

task :deps => :joined_data do
end