Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SVD Software no longer available #25

Open
ricardoharripaul opened this issue Jan 1, 2021 · 5 comments
Open

SVD Software no longer available #25

ricardoharripaul opened this issue Jan 1, 2021 · 5 comments

Comments

@ricardoharripaul
Copy link

I was trying to run the CLAMMS software and I noticed the SVD software is not available from MIT anymore. Are there alternative methods to use or is there an alternative download link?

Thanks!

@tpjones15
Copy link

@ricardoharripaul
Copy link
Author

Thanks for the link.
The link works, however, I followed the instructions for creating the matrix for the SVD software creates an error.

$SVD -d 4 -o svd-output -r dt matrix.txt
Loading the matrix...
ERROR: svdLoadDenseTextFile: bad file format
ERROR: failed to read sparse matrix. Did you specify the correct file type with the -r argument?

Changing the format to sparse text default seemed to run but I do not know if the output is correct. The pca.coordinates.txt file seems to contain only 0 for all four columns.

My pca coordinate file seems incomplete:

$SVD -d 4 -o svd-output -r st matrix.txt
Loading the matrix...
Computing the SVD...
SOLVING THE [A^TA] EIGENPROBLEM
NO. OF ROWS = 415
NO. OF COLUMNS = 4333
NO. OF NON-ZERO VALUES = 0
MATRIX DENSITY = 0.00%
MAX. NO. OF LANCZOS STEPS = 415
MAX. NO. OF EIGENPAIRS = 4
LEFT END OF THE INTERVAL = -1.00E-30
RIGHT END OF THE INTERVAL = 1.00E-30
KAPPA = 1.00E-06

TRANSPOSING THE MATRIX FOR SPEED
NUMBER OF LANCZOS STEPS = 1
RITZ VALUES STABILIZED = 21
SINGULAR VALUES FOUND = 0

ELAPSED CPU TIME = 0 sec.
MULTIPLICATIONS BY A = 3
MULTIPLICATIONS BY A^T = 3

head pca.coordinates.txt
753 0 0 0 0
753_recaled 0 0 0 0
ANMR22-7-IAU25-2_S1 0 0 0 0
ANMR48-1_S1 0 0 0 0
ARSID-M-10-5 0 0 0 0
ARSID-M-11-3 0 0 0 0

Thanks!

@samreenzafer
Copy link

Did you figure this out? I tried "-r st" but still get the same error. My matrix file is all zeros. Is that expected?

./svd -d 4 -o svd-output -r st matrix.txt
Loading the matrix...
ERROR: svdLoadSparseTextFile: bad file format
ERROR: failed to read sparse matrix. Did you specify the correct file type with the -r argument?

@ricardoharripaul
Copy link
Author

ricardoharripaul commented Oct 18, 2021 via email

@tpjones15
Copy link

tpjones15 commented Oct 19, 2021

Hi all, firstly, apologies Ricardo, I must have missed your original reply.

This was a while ago for me too.. I believe it's an error with the example code when making the matrix file.

Where it says:
awk '$1 != "X" && $1 != "Y" && $NF == 0 { print $4 }' $FILE \

The && $NF == 0 means that the 4th field is filtered for whenever it == 0 (why this is the case, I don't know). So you just get all the 4th column 0's printed from the normalised coverage file into the matrix. I just removed this part of the line..

ls *.norm.cov.bed | while read FILE do awk '$1 != "X" && $1 != "Y" { print $4 }' $FILE \ | gawk -f $CLAMMS_DIR/transpose.gawk >>matrix.txt done
should work

Or, as the most recent issue suggests, change it to $NF != 0 (I'm not sure which would be most appropriate) - I would imagine removing it entirely, as otherwise you might get a discrepancy between the number of values from each file

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants