-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error: Cannot open input sequence files. output/clean.main.unmapped_1.fastq output/clean.main.unmapped_2.fastq #24
Comments
Hi, Please check if the path of And from the error messages, there are two more issues need to be noticed:
Thanks for reporting, |
Thanks for your method.
But still have some errors.
|
NCLscan.config
|
Hi,
Please use "gencode.v34.pc_transcripts.fa" instead. tw |
Hi tw, thanks, |
Hi tw,
Could you please share some protocol of annotation of circRNA?
I check there is a package “ circRNAprofiler: An R-Based Computational Framework for the Downstream Analysis of Circular RNAs”, but for NCLscan is only for hg19, but my work is base on hg38.
Best,
Jian-Guo
… 在 2020年8月12日,04:36,Tai-Wei Chiang ***@***.***> 写道:
Hi,
...
## The protein-coding transcript sequences, eg. /path/to/gencode.v19.pc_transcripts.fa
Protein_coding_transcripts = /home/rstudio/hdd/reference/hg38_ek12/gencode.v34.pc_translations.fa
...
Please use "gencode.v34.pc_transcripts.fa" instead.
tw
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub <#24 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AE2DUKPGSOANACRAVUKT663SAH5ZJANCNFSM4PZMMBVQ>.
|
Ah... If you need coordinates converting, you can use But if it is about circRNAprofiler, maybe you should just open an issue on their GitHub page, |
Thanks, i will contact circRNAprofiler. |
I used NCLscan
After run,
But, I don't know keep which docunments and input to circRNAprofiler. |
Hi tw,
for NCLscan.config
However, my CPU only used 20% when running novoalign... |
Hi,
Did you have your novoalign license under the same path with the Thanks, |
Hi tw, |
Hi,
You're right! There is no "non-profit" mode for novoalign V4, it must be run with a license. But usually, And, to be honest, I don't have any ideas about the issue right now. Best, |
I mean I have the license if novoalign v4, but you suggest used novoalign v3. |
I mean I have the license if novoalign v4, but you suggest used novoalign v3.
As Colin Hercus, whom works at Novocraft said that "That's excellent. V4 should also work with the license file and is 3 to 4 times faster".
But, you recommend is use v3.
So, v4 is works for NCLscan?
Best
Jian-Guo Zhou MD
Tai-Wei Chiang <notifications@github.com> 于 2020年8月18日星期二 GMT+2 下午01:25:00写道:
Hi,
Hi,
However, my CPU only used 20% when running novoalign...
Did you have your novoalign license under the same path with the novoalign program?
Thanks,
tw
Hi tw,
I already have the license of novoalign. And I can use novoalign v4.
Best,
You're right! There is no "non-profit" mode for novoalign V4, it must be run with a license.
But usually, novoalign should use as many CPU as possible for its mapping job.
And, to be honest, I don't have any ideas about the issue right now.
Best,
tw
—
You are receiving this because you modified the open/close state.
Reply to this email directly, view it on GitHub, or unsubscribe.
|
Hi, So, for now, we still suggest the users to use novoalign V3 with NCLscan. Please refer to issue #21 . Thanks, |
For sure, I understand. |
cd ~/biosoft
cd NCLscan
./NCLscan.py -c ./NCLscan.config -pj clean -o output --fq1 EGAR00001653004_1.fastq.gz --fq2 EGAR00001653004_2.fastq.gz
Unknown command "view".
[M::bwa_idx_load_from_disk] read 0 ALT contigs
[E::main_mem] fail to open file `EGAR00001653004_1.fastq.gz'.
Unknown command "view".
gzip: EGAR00001653004_1.fastq.gz: No such file or directory
gzip: EGAR00001653004_2.fastq.gz: No such file or directory
Sequence file is empty: output/clean.main.bwa.unmapped_1.fastq
Error: Cannot open input sequence files. output/clean.main.bwa.unmapped_1.fastq output/clean.main.bwa.unmapped_2.fastq
Unknown command "view".
End of file reading 4 bytes
Total time cost = 0.00259613990784 sec
PslChimeraFilter v0.4
End of file reading 4 bytes
Total time cost = 0.00266909599304 sec
End of file reading 4 bytes
Total time cost = 0.00257301330566 sec
End of file reading 4 bytes
Total time cost = 0.0025110244751 sec
JunctionSite2BED v0.3
Reading annotations on chr1.
Reading annotations on chr2.
Reading annotations on chr3.
Reading annotations on chr4.
Reading annotations on chr5.
Reading annotations on chr6.
Reading annotations on chr7.
Reading annotations on chr8.
Reading annotations on chr9.
Reading annotations on chr10.
Reading annotations on chr11.
Reading annotations on chr12.
Reading annotations on chr13.
Reading annotations on chr14.
Reading annotations on chr15.
Reading annotations on chr16.
Reading annotations on chr17.
Reading annotations on chr18.
Reading annotations on chr19.
Reading annotations on chr20.
Reading annotations on chr21.
Reading annotations on chr22.
Reading annotations on chrX.
Reading annotations on chrY.
Reading annotations on chrM.
Read 60669 genes, 228048 transcripts and 1378888 exons from the gtf file.
novoindex (4.2) - Universal k-mer index constructor.
(C) 2008 - 2011 NovoCraft Technologies Sdn Bhd
novoindex output/clean.JS.ndx output/clean.JS.fa
Creating 23 indexing threads.
Building with 9-mer and step of 1 bp.
novoindex construction dT = 0.0s
Index memory size 0.001Gbyte.
Done.
Sequence file is empty: output/clean.main.unmapped_1.fastq
Error: Cannot open input sequence files. output/clean.main.unmapped_1.fastq
Sequence file is empty: output/clean.main.unmapped_2.fastq
Error: Cannot open input sequence files. output/clean.main.unmapped_2.fastq
End of file reading 4 bytes
Total time cost = 0.00268507003784 sec
End of file reading 4 bytes
Total time cost = 0.00269103050232 sec
PslChimeraFilter v0.4
JunctionSite2BED v0.3
Reading annotations on chr1.
Reading annotations on chr2.
Reading annotations on chr3.
Reading annotations on chr4.
Reading annotations on chr5.
Reading annotations on chr6.
Reading annotations on chr7.
Reading annotations on chr8.
Reading annotations on chr9.
Reading annotations on chr10.
Reading annotations on chr11.
Reading annotations on chr12.
Reading annotations on chr13.
Reading annotations on chr14.
Reading annotations on chr15.
Reading annotations on chr16.
Reading annotations on chr17.
Reading annotations on chr18.
Reading annotations on chr19.
Reading annotations on chr20.
Reading annotations on chr21.
Reading annotations on chr22.
Reading annotations on chrX.
Reading annotations on chrY.
Reading annotations on chrM.
Read 60669 genes, 228048 transcripts and 1378888 exons from the gtf file.
novoindex (4.2) - Universal k-mer index constructor.
(C) 2008 - 2011 NovoCraft Technologies Sdn Bhd
novoindex output/clean.JS2.ndx output/clean.JS2.fa
Creating 23 indexing threads.
Building with 9-mer and step of 1 bp.
novoindex construction dT = 0.0s
Index memory size 0.001Gbyte.
Done.
Sequence file is empty: output/clean.main.unmapped_1.fastq
Error: Cannot open input sequence files. output/clean.main.unmapped_1.fastq output/clean.main.unmapped_2.fastq
End of file reading 4 bytes
Total time cost = 0.00274515151978 sec
End of file reading 4 bytes
Total time cost = 0.00262308120728 sec
End of file reading 4 bytes
Total time cost = 0.00268697738647 sec
End of file reading 4 bytes
Total time cost = 0.00296902656555 sec
Traceback (most recent call last):
File "/home/rstudio/biosoft/NCLscan/bin/Add_read_count.py", line 118, in
add_read_count(args.result_tmp_file, args.result_sam_file, args.output, args.JSParser_bin)
File "/home/rstudio/biosoft/NCLscan/bin/Add_read_count.py", line 13, in add_read_count
all_junc_read_with_ref = get_junc_read(result_sam_data, JSParser_bin)
File "/home/rstudio/biosoft/NCLscan/bin/Add_read_count.py", line 66, in get_junc_read
junc_read_data = get_read_with_ref(junc_read_sam_data)
File "/home/rstudio/biosoft/NCLscan/bin/Add_read_count.py", line 46, in get_read_with_ref
ref_id = re.sub(".[0-9]*$", "", line[2])
IndexError: list index out of range
Traceback (most recent call last):
File "/home/rstudio/biosoft/NCLscan/bin/get_gene_name.py", line 91, in
add_gene_name(args.result_tmp_file, args.gene_anno, args.output)
File "/home/rstudio/biosoft/NCLscan/bin/get_gene_name.py", line 8, in add_gene_name
result_tmp_data = read_TSV(result_tmp_file)
File "/home/rstudio/biosoft/NCLscan/bin/get_gene_name.py", line 64, in read_TSV
with open(tsv_file) as data_reader:
IOError: [Errno 2] No such file or directory: 'output/clean.result.tmp2'
Traceback (most recent call last):
File "./NCLscan.py", line 448, in
NCL_Scan4(config, datasets_list, args.project_name, args.output_dir)
File "./NCLscan.py", line 255, in NCL_Scan4
final_tmp = read_TSV("{prefix}.result.tmp3".format(**config_options))
File "./NCLscan.py", line 279, in read_TSV
with open(tsv_file) as data_reader:
IOError: [Errno 2] No such file or directory: 'output/clean.result.tmp3'
Please help me check those errors.
Best,
The text was updated successfully, but these errors were encountered: