Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Split workers image into job-specific images and configure Nomad to use the correct one for each job. #55

Closed
kurtwheeler opened this issue Sep 19, 2017 · 1 comment

Comments

@kurtwheeler
Copy link
Contributor

kurtwheeler commented Sep 19, 2017

Context

As we continue to create different Processors the ccdl/dr_workers Docker image will continue to grow to contain all the dependencies and (at least some of the data) the different jobs need. This makes for a very bulky image that takes a long time to build, upload, and download along with requiring a non-trivial amount of disk space to store.

Problem or idea

We should have a Docker image for every type of job we have. We should also have different Nomad job specs for each type of job, rather than having a single processor.nomad job spec for all of them.

Solution or next step

Split the workers Docker image into separate images for each job type and create Nomad job specifications for each job type. Note that some jobs may still be able to an image, for example the SCAN.UPC image should be usable by both Affymetrix and Illumina specific jobs types.

Finally, change data_refinery_common.message_queue.send_job function to specify job types directly to Nomad rather than specifying the job type via the job parameters.

@kurtwheeler kurtwheeler added this to the Create Nomad job specifications milestone Sep 19, 2017
@kurtwheeler kurtwheeler added backlog and removed next labels Jan 24, 2018
@kurtwheeler kurtwheeler changed the title Configure Data Refinery to send the correct job type Split workers image into job-specific images and configure Nomad to use the correct one for each job. Mar 16, 2018
@cgreene cgreene modified the milestones: Create Nomad job specifications, VJackieP3 - Autoimmune/rheumatic disease compendium Apr 8, 2018
@cgreene cgreene modified the milestones: VJackieP3 - Autoimmune/rheumatic disease compendium, Data Refinery Version 1 Apr 26, 2018
@jaclyn-taroni
Copy link
Member

We just talked about what an "RNA-seq" Docker image might contain: RSEM, Salmon, salmontools, tximport and for QC purposes FastQC, MultiQC

Just recording this conversation here, which is to say this is not a substitute for looking at the code for the processor!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants