-
Notifications
You must be signed in to change notification settings - Fork 421
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Samplesheet of sarek #1605
Comments
Hey! I believe the samplesheet error is a bit of a false flag, and the actual error is reported further down the log:
The genome you picked - |
I try with skip option but the problem still remain with this message: And also with this one: |
Ah I think you are missing the column |
Thank you. Seems that the problem was the lane column (I changed the ";" with "," using vim before running the pipeline). This file get an error probably due to the 4th column (I attached a preview of this file, can you please confirm this?) which I have removed naming the new file "prova.bed". Caused by: Command executed: bgzip --threads 1 -c prova.bed > prova.bed.gz cat <<-END_VERSIONS > versions.yml Command exit status: Command output: Command error: Work dir: Tip: you can replicate the issue by changing to the process work dir and entering the command |
Sorry I have wrongly closed my issue. I have reopened it. I hope this is not a problem |
yes you will need to sort the bedfile. You can use bedtools for this. Other things I have done in the past depending on your analysis is padding the bed file on each side (depending on your read length, 50-100bp or so). This could cause adjacent regions to overlap in which case you also need to merge them. Here are the commands I used in the past to prepare the bed file:
|
Thank you very much! According to your suggestion I have changed also the genome reference to the standard GRCh38 and now all the warning about panel of normals and dbsnp and know_indels were solved. |
|
@Poocee has your issue been solved? In that case I would close the issue |
Dear Friederike,
Yes, my code is running without any problems (at least at the moment ☺).
I want to thank you again for your promptness in replying and your availability.
Best regards,
Stefano
Da: Friederike Hanssen ***@***.***>
Inviato: giovedì 8 agosto 2024 12:20
A: nf-core/sarek ***@***.***>
Cc: Percio Stefano ***@***.***>; Mention ***@***.***>
Oggetto: Re: [nf-core/sarek] Samplesheet of sarek (Issue #1605)
@Poocee<https://github.com/Poocee> has your issue been solved? In that case I would close the issue
—
Reply to this email directly, view it on GitHub<#1605 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AW32NTLRZSSKGGH6KVYHGMLZQNA63AVCNFSM6AAAAABLUILK3SVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDENZVGQ3DQOJSHA>.
You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>
|
Dear Friederike, Pipeline completed with errors- Caused by: [mem_sam_pe] paired reads have different names: "LH00193:5:22CNKWLT3:7:1101:24587:1048", "LH00193:16:222TJ2LT4:7:1101208:1056" I attached you the log file |
Looks some issue with your fastq file. I found this: https://www.biostars.org/p/254155/ maybe it helps |
Unfortunately I have receive these files so I cannot go back up. I check your link but I didn't find any solution to my problem. In addition I don't understanf why the problem looks at only one sample and not to all my fastq files. |
In that case I would probably try to inspect the files with tools like
All samples are run in parallel and by distinct jobs. It discovered the issue for one sample (KQ25-1) and reported it. |
Dear Friederike,
I follow your suggestion and I have used seqkit to modify my data and resolve the problem.
Unfortunately, now I have another problem about only one sample that I not understand.
I attach the report about the error description. In addition, I don’t understand why the pipeline doesn’t procede even if only one sample gives a mistake (only fastp and fastqc reports are present).
Could you please help me.
Thankd in advance.
Best,
Stefano
Da: Friederike Hanssen ***@***.***>
Inviato: lunedì 12 agosto 2024 16:42
A: nf-core/sarek ***@***.***>
Cc: Percio Stefano ***@***.***>; Mention ***@***.***>
Oggetto: Re: [nf-core/sarek] Samplesheet of sarek (Issue #1605)
Unfortunately I have receive these files so I cannot go back up. I check your link but I didn't find any solution to my problem.
In that case I would probably try to inspect the files with tools like seqkit to make sure they are valid fastq files.
In addition I don't understanf why the problem looks at only one sample and not to all my fastq files.
It discovered the issue for one sample (KQ25-1) and reported it.
—
Reply to this email directly, view it on GitHub<#1605 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AW32NTNZNGPJTQCB4E6I4D3ZRDCU5AVCNFSM6AAAAABLUILK3SVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEOBUGE3TIMJSGU>.
You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>
|
I am having a similar issue, where the error is given as being related to the sample_sheet.csv, but this sample sheet has been accepted before. There is also an "Cause: Missing process or function Channel.empty([[]])" error, and then further down I've added below my .sh, .json and .nextflow.log files here I'm using: nextflow.log.txt Many thanks in advance! |
@stw8127 : I think you encountered another issue. Using an older version of NF might help: #1622 |
Thank you very much, looks like it's going now on v. 3.2.2. I'll have a read on the other thread! |
Description of the bug
I am getting an error in the samplesheet composition when running sarek pipelines. I have read many post about this kind of error when runned with different pipelines, but I didn't find anything about a possible solution for my issue.
Speciffically the error reports this message:
"The sample-sheet only contains normal-samples, but the following tools, which were requested with "--tools", expect at least one tumor-sample : mutect2"
Unfortunately, I don't understand what I missed since my samplesheet have both normal and tumor samples (also relapse tumor for some samples), as specified in the usage page.
Command used and terminal output
Relevant files
samplesheet_prova.csv
nextflow.log
System information
N E X T F L O W ~ version 24.04.3
HPC
Slurm
Singularity
nf-core/sarek v3.4.2-gb5b766d
The text was updated successfully, but these errors were encountered: