Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bugfix: is_unexpected_replicate data missing in QC_Report.xlsx #304

Merged
merged 6 commits into from
Jul 9, 2024

Conversation

jaamarks
Copy link
Collaborator

@jaamarks jaamarks commented Jun 25, 2024

This resolves the issue of missing data for the is_unexpected_replicate column in the SAMPLE_QC, SAMPLE_CONCORDANCE, and SUBJECT_QC sheets of QC_Report.xlsx.

The column name was changed from Unexpected Replicate to is_unexpected_replicate in commit 825c8a57e2003e8c5a55aded8759584f950752cc, causing blank entries due to the reindexing step in qc_report_tables.py. Reverting the column name to Unexpected Replicate fixes this issue.



Additionally

Refactoring + minor fixes related to the QC_Report.



Fixes #303

@jaamarks jaamarks force-pushed the issue-303-empty-qcreport-columns branch 3 times, most recently from 7acf086 to da34dd1 Compare July 8, 2024 16:55
jaamarks added 6 commits July 9, 2024 11:38
The `QC_Report.xlsx` was missing data for the `is_unexpected_replicate`
column in the `SAMPLE_QC`, `SAMPLE_CONCORDANCE`, and `SUBJECT_QC`
sheets. This issue arose from a column name change in commit
`825c8a57e2003e8c5a55aded8759584f950752cc`, where `Unexpected Replicate`
was renamed to `is_unexpected_replicate`.

The renaming caused blank entries due to the reindexing step in
`qc_report_tables.py`. During reindexing, `is_unexpected_replicate` was
not found in the DataFrame because of the earlier renaming, leading to
NaN values. Pandas docs state, "By default, values in the new index that
do not have corresponding records in the DataFrame are assigned NaN."

To resolve this, the column names have been reverted to their original
title, `Unexpected Replicate`.
- Renamed parameter `sample_qc_csv` to `subject_qc_csv` to better reflect its intended use.
…s Removed"

We noticed a discrepancy in the terminology we are using in Table 4a.
It uses "Expected Duplicates Removed", while other parts of the pipeline
uses "Expected Replicates". So we updated Table 4a to
"Expected Replicates Removed" for consistency throughout the pipeline.
@jaamarks jaamarks force-pushed the issue-303-empty-qcreport-columns branch from 070a6fb to f4fd99e Compare July 9, 2024 15:38
@jaamarks jaamarks merged commit 210e1b8 into default Jul 9, 2024
2 checks passed
@jaamarks jaamarks deleted the issue-303-empty-qcreport-columns branch July 9, 2024 16:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Empty is_unexpected_replicate column in QC_Report.xlsx
1 participant