Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

returnSamples(sampleMetadata, dataType = "gene_families") crashing #304

Open
amsegura opened this issue Jun 12, 2024 · 1 comment
Open
Assignees

Comments

@amsegura
Copy link

Hello,

First I wanted to thank you for the work you have done with this package. This is my issue (connected with #302 but I did not see a solution there):

Describe the bug
I am trying to run returnSamples to obtain gene_families data from different projects. I am running the code in a cluster managed by slurm. I have done it before for the same projects but using as "dataType="relative_abundance" without any problem.

To Reproduce
This is the code I am using:

data("sampleMetadata")

df <- sampleMetadata %>% 
filter(age>= 18 & disease=='healthy' & body_site=='stool' & study_condition=='control' & !is.na(gender)) %>% 
filter(!duplicated(subject_id)) %>% 
filter(!is.na(BMI)) %>%  
filter(BMI < 30)

df_genfam <- df %>% returnSamples("gene_families", counts = TRUE)

Expected behavior
I will expect the data to download as it happened with relative_abundance data.

Error Obtained

Error: failed to load resource
  name: EH5579
  title: 2021-03-31.HMP_2012.gene_families
  reason: 1 resources failed to download
In addition: Warning messages:
1: download failed
  web resource path: '[https://experimenthub.bioconductor.org/fetch/5622](https://urldefense.com/v3/__https://experimenthub.bioconductor.org/fetch/5622__;!!D9dNQwwGXtA!X0WkCq_ax8IC9QswMQSXcFAVcVxXx9-guTcBNXi3yfHX8qPLXQAfAqfz0ZEcgsSRnxXe2N3tjNYfWHoREclLhufr21IA7CTaKLO_T2M$)'
  local file path: '/lustre/home/amartin/.cache/R/ExperimentHub/1a1245233933b3_5622'
  reason: Failed writing body (8192 != 16384) 
2: bfcadd() failed; resource removed
  rid: BFC124
  fpath: '[https://experimenthub.bioconductor.org/fetch/5622](https://urldefense.com/v3/__https://experimenthub.bioconductor.org/fetch/5622__;!!D9dNQwwGXtA!X0WkCq_ax8IC9QswMQSXcFAVcVxXx9-guTcBNXi3yfHX8qPLXQAfAqfz0ZEcgsSRnxXe2N3tjNYfWHoREclLhufr21IA7CTaKLO_T2M$)'
  reason: download failed 
3: download failed
  hub path: '[https://experimenthub.bioconductor.org/fetch/5622](https://urldefense.com/v3/__https://experimenthub.bioconductor.org/fetch/5622__;!!D9dNQwwGXtA!X0WkCq_ax8IC9QswMQSXcFAVcVxXx9-guTcBNXi3yfHX8qPLXQAfAqfz0ZEcgsSRnxXe2N3tjNYfWHoREclLhufr21IA7CTaKLO_T2M$)'
  cache resource: 'EH5579 : 5622'
  reason: bfcadd() failed; see warnings() 
Execution halted

Additional context
I am aware that this data is heavier than just relative abundance. However if I try to download just the file EH5579 I manage to do it, so I guess it is not a problem with the file being corrupted. I assigned in the cluster 425Gb RAM but I still think it crashes due to RAM memory problems.

Any idea on how can I solve this issue.

Thank you in advance for your help.

@Xueyao0830
Copy link

I also have this issue, but mine is that i cannot even download the file like EH7254, (i manange to download the file EH5579). is it the problem of RAM?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants