-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[enhancement] how to obtain the number of cases in the summary statistics output for binary phenotypes? #348
Comments
Hi, I have made a note to output the number of cases/controls for BTs when using Cheers, |
I see, I did not know about option Will generate the following columns:
One drawback is definitely that the p-value is not expressed as -log10(p) anymore and so this makes retrieving |
I wanted to second @freeseek's request -- for directly typed genotypes (WGS, array) where there is per-SNP missingness, it is non-trivial to get the case and control numbers intersecting with the called genotypes post-hoc, so this would add value. |
Hi, Thank you both for your valuable feedback. The columns N_CASES and N_CONTROLS will be added when using Cheers, |
This is now available in v3.2.2 (released here). Cheers, |
When I run regenie step 2 with option
--bt
for binary phenotypes, the output contains the following columns:The
N
column contains the total number of samples in the association. However, it would be useful to also have the number of cases in the output table, as from the total number of samples and the number of cases you can compute the effective sample size which can be useful when running a fixed effect meta-analysis weighted by the sample size.I don't necessarily expect this to be a mandatory field in the output, but at least it would be nice to have an option to include this information. I often convert the output of regenie into GWAS-VCF using the following map:
But the GWAS-VCF format has also a
NC
field that could host the number of cases.Separately from the main request, would there be interest in allowing regenie to generate summary statistics outputs as GWAS-VCF files? If regenie had the option to output summary statistics as binary GWAS-VCF files, this could also allow keeping the statistics as 32-bits floats without having to round and lose the less significant digits.
The text was updated successfully, but these errors were encountered: