Release 0.1.0 #90

fellen31 · 2024-04-22T08:55:54Z

This PR aims to make a pre-release of the pipeline, which is on the whole, functional.

There's still lots of things I would like to fix before a 1.0 release though. Including going through each subworkflow and updating/tidying it. At the same time, it would be good to have a version of the pipeline since some data will be generated with it before 1.0.

So there are a number of things this PR does not concern itself with, that I aim to adress in future releases:

General:

Outstanding bugs are not fixed.
Citations list is not updated.
Config does not use [‘ ‘].join(‘ ‘) everywhere.

Subworkflows:

Subworkflows are not always matching naming of config and/or output dir
Most sort and index processes that can be merged have not been so.
Most sort and concat indexes that can be merged have not been so.
The short_variant_calling process naming is bad and confusing, and the above also apply.
Inputs and outputs are not explained with // channel: [ val(meta), reads ]`
Specifically for the ALIGN_READS subworkflow, single_end will be added to meta when processing samplesheet instead of doing that here.
Specifically for the GENOME_ASSEMBLY workfow, the trio-calling is not well explained and hard to understand.

Modules:

Local modules have not been added to nf-core.
Local modules are not tested.
Local modules that use meta.id instead of prefix have not been fixed.
Specifically for the DIPCALL module it is awful (a deconstructed make script that could not be run well within a nextflow process), this will most likely be replaced with another software.
Specifically for the HIPHASE module I could not get functions working.

PR checklist

…ngle element channels

Full size test fails because only `BCFTOOLS_NORM_SINGLESAMPLE` and not `BCFTOOLS_NORM` was actually decomposing variants.

rannick · 2024-04-30T08:40:27Z

.github/workflows/ci.yml

    runs-on: ubuntu-latest
    strategy:
      matrix:
+        parameters:
+          - ""


A test with no parameters?

Yes, so to run the -profile test with no extra arguments.

rannick · 2024-04-30T08:43:14Z

CHANGELOG.md


 The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/)
 and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).

-## v1.0dev - [date]
+## v0.1.0 - [date]


Update date once ready

rannick · 2024-04-30T08:49:30Z

conf/base.config

    }
    withLabel:process_high_memory {
-        memory = { check_max( 200.GB * task.attempt, 'memory' ) }
+        memory = { check_max( 218.GB * task.attempt, 'memory' ) }
    }
    withLabel:error_ignore {
        errorStrategy = 'ignore'


Does nallo processes need that much more than the standard compute? I would usually not meddle with the base config but specify per module in modules.config if some modules need extra.

Some need a lot to merge many samples, but I should keep this label 200.

rannick

Nice work!

jemten

Massive work @fellen31 ⭐
As you write in the PR description their are some workflows that would benefit from a refactoring. Also, quite a few of the local modules uses your private dockerhub, let's see if we can shift a few of those in the next release.

assets/schema_input.json

bin/split_bed_chunks.py

conf/base.config

conf/modules/structural_variant_calling.config

workflows/nallo.nf

jemten · 2024-05-02T07:35:28Z

workflows/nallo.nf

Would be good to rework some of the more deeply nested if statements.

Yes! General preference between using if/switch statements in workflow vs ext.when?

Here I we could branch out into more subworkflows. But otherwise, it's a balance... I kind of prefer the ext.when from an aesthetic point of view, but then I had people complaining that it wasn't clear from the code what was happening 😅

jemten · 2024-05-02T07:41:30Z

subworkflows/local/snv_annotation.nf

+
+    // Index and normalize single sample vcfs
+    BCFTOOLS_INDEX_SINGLESAMPLE(ch_single_sample_vcf)
+    ss = ch_single_sample_vcf.join(BCFTOOLS_INDEX_SINGLESAMPLE.out.csi)


guessing that ss is "single sample" but it would be good to have a more explicit channel name.

jemten · 2024-05-02T07:51:49Z

subworkflows/local/short_variant_calling.nf

+    // Collect GVCFs
+    ch_snp_calls_gvcf = ch_snp_calls_gvcf.mix(DEEPVARIANT.out.gvcf)
+
+    // TODO: This only works with DeepVariant for now (remove PEPPER_MARGIN_DEEPVARIANT/Deeptrio?)


Co-authored-by: Anders Jemt <jemten@users.noreply.github.com>

* Add review suggestions + full test changes

jemten

Looks good to me! We can work on the remaining issues in coming PRs/releases

* Update CODEOWNERS Change the code-owners to the GitHub team. This way we can more easily change the team and not having to update the CODEOWNERS * Fixed org

* Update whatshap stats version to avoid ZeroDivisionError * Update release date

jemten

Looks good!

This reverts commit eda623d.

Release 0.1.0 (#90)

fellen31 and others added 30 commits May 27, 2023 13:59

Add methylation calls to bedmethyl

71069f5

Output whatshap stats, and add pipeline presets

2cd039f

Fix mosdepth not running for all samples

7eba378

Fix methylation sample mix

fa201fa

fix dipcall not working

a7bcfed

Add fastq and bam stats

bd7abd5

fix dipcall not working

e3eafd7

Rework modules and subworkflows to match new-found knowledge about si…

e8efba1

…ngle element channels

Added TRGT for pacbio-data

9fe7224

Fix repeat versions

bf8681a

Fix samtools index publishdirs

7f73dbe

Add back --secondary=no to minimap2 align

9fbe31c

Fix cramino phased and unphased

cb7d344

Update README.md

1a87410

Fix dipcall sample name in vcf

041ed25

Merge branch 'dev' of github.com:fellen31/skierfe into dev

850fb79

Fix trgt container adress

a69892b

try again to fix trgt container adress

8273624

remove TRVZ because it will create too many processes

9390bef

completely remove TRVZ

5547a00

update modules conf, missing output files, index bams and fix containers

b72d3f1

Fix minimap2 index always using default params

40bae3d

minimap index needs to be built separate for dipcall as well

0130ee0

tweak process requirements

643c92f

Improve whatshap phasing

852c550

Add hiphase

d5b92ba

add missing bcftools modules

aadb574

Add samplesheet, extra_gcvf and extra_snfs validation

7710a79

update hifiasm

9fde2d4

completely remove PED and rely on samplesheet

ee35a39

fellen31 added 6 commits April 24, 2024 11:48

Update docs/usage.md

62be3b0

Update docs/usage.md

791df89

Delete annotations directory (#97)

ad4f298

Delete assets/test_data directory (#99)

0e360cb

Update download_pipeline.yml (#96)

79a4113

Update snv_annotation.config (#98)

5173489

Full size test fails because only `BCFTOOLS_NORM_SINGLESAMPLE` and not `BCFTOOLS_NORM` was actually decomposing variants.

rannick reviewed Apr 30, 2024

View reviewed changes

rannick previously approved these changes Apr 30, 2024

View reviewed changes

jemten reviewed May 2, 2024

View reviewed changes

Update conf/modules/structural_variant_calling.config

c493253

Co-authored-by: Anders Jemt <jemten@users.noreply.github.com>

fellen31 dismissed rannick’s stale review via c493253 May 2, 2024 18:25

Apply suggestions from code review and full test issues (#107)

a423730

* Add review suggestions + full test changes

jemten previously approved these changes May 3, 2024

View reviewed changes

Update CODEOWNERS (#112)

148ad2c

* Update CODEOWNERS Change the code-owners to the GitHub team. This way we can more easily change the team and not having to update the CODEOWNERS * Fixed org

fellen31 dismissed jemten’s stale review via 148ad2c May 3, 2024 12:38

jemten previously approved these changes May 3, 2024

View reviewed changes

Update whatshap stats version to avoid ZeroDivisionError (#133)

1a86b88

* Update whatshap stats version to avoid ZeroDivisionError * Update release date

fellen31 dismissed jemten’s stale review via 1a86b88 May 6, 2024 13:37

fellen31 added 2 commits May 7, 2024 10:22

Update bcftools merge (#134)

aa874bb

Bump date in changelog

387f8da

jemten previously approved these changes May 8, 2024

View reviewed changes

Since sex is now numberic, update branching criteria

ce4aa1c

fellen31 dismissed jemten’s stale review via ce4aa1c May 8, 2024 09:19

Was fixed before, but was overwritten

fc7a779

jemten approved these changes May 8, 2024

View reviewed changes

fellen31 merged commit eda623d into master May 8, 2024
25 of 27 checks passed

fellen31 added a commit that referenced this pull request May 8, 2024

Revert "Release 0.1.0 (#90)"

6f3bc5c

This reverts commit eda623d.

fellen31 added a commit that referenced this pull request May 13, 2024

Merge pull request #137 from genomic-medicine-sweden/master

485a83b

Release 0.1.0 (#90)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release 0.1.0 #90

Release 0.1.0 #90

fellen31 commented Apr 22, 2024 •

edited

Loading

rannick Apr 30, 2024

fellen31 May 2, 2024

rannick Apr 30, 2024

rannick Apr 30, 2024

fellen31 May 2, 2024

rannick left a comment

jemten left a comment

jemten May 2, 2024

fellen31 May 3, 2024

jemten May 3, 2024

jemten May 2, 2024

jemten May 2, 2024

jemten left a comment

jemten left a comment

Release 0.1.0 #90

Release 0.1.0 #90

Conversation

fellen31 commented Apr 22, 2024 • edited Loading

PR checklist

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rannick left a comment

Choose a reason for hiding this comment

jemten left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jemten left a comment

Choose a reason for hiding this comment

jemten left a comment

Choose a reason for hiding this comment

fellen31 commented Apr 22, 2024 •

edited

Loading