-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Write batch correct results to scratch folder and delete pilot modules #309
Conversation
@aadamk, @jharenza suggested we update the module to avoid writing large files within the module, which subsequently get uploaded to GitHub. Module outputs have been problematic when cloning or syncing locally because they are being stored on the GitHub LFS. I have updated the two main code scripts to write output locally to the repo
|
just a hunch - when you wrote to scratch maybe you need to call them back in from scratch for additional scripts? |
@jharenza, I am calling them back from scratch. Works ok when run on the full datasets on EC2. The GA error with the contrast function here seems to suggest that there is a predictor variable in the matrix without levels |
@aadamk, @jharenza clarified both of you had agreed to exclude NBL because v11 currently does not have subtyping. Maybe the contrast error in both the CI subset and the complete datasets will go away in v12 following subtyping. I'll exclude GA checking NBL with an if statement using a CI environmental variable as we do in other modules. After v12, we can run without setting the CI environmental variable. |
hi @ewafula - yes, subtyping for nbl was moved over to pathology free text with plans to move it back in v12. as such, i commented out that code so that it could pass CI, though I your plan with env var sounds good to me. thank you. |
Purpose/implementation Section
What scientific question is your analysis addressing?
Module result files are too large and are now all written locally to the OpenPedCan-analysis repository scratch directory (
OpenPedCan-analysis/scratch/
)What was your approach?
update code to write results to the
scratch/
folderWhat GitHub issue does your pull request address?
Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.
Which areas should receive a particularly close look?
Is there anything that you want to discuss further?
NA
Is the analysis in a mature enough form that the resulting figure(s) and/or table(s) are ready for review?
NA
Results
What types of results are included (e.g., table, figure)?
What is your summary of the results?
Results written locally to repo scratch folder
Reproducibility Checklist
Documentation Checklist
README
and it is up to date.analyses/README.md
and the entry is up to date.