-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fastq format? #23
Comments
@marcucio Not currently, but it could probably be added easily - want to send a PR? |
I actually also have the variant files in the vcf format which will be easier to work with. I could write the parser for it but I need to figure out how to find rsids. Any suggestions on how to find the rsids? This is how the info is formatted:
|
We could pull in the SNP data from the NIH? ftp://ftp.ncbi.nih.gov/snp/database/organism_data/human_9606 (you can open this in chrome) |
Yea, I was looking at that. I was hoping there would be a json file somewhere where I can easily query the SNP data. Maybe that will be a separate project to write a small app that pulls from that site and makes a json file that we can use. Hopefully I can find some time in the next few days to look into that part of it further.
Thanks,
…--
Mike Marcucio
marcucio.com <http://marcucio.com/>
mike@marcucio.com <mailto:mike@marcucio.com>
blog <http://blog.getitdoneapp.com/> | twitter <https://twitter.com/GetItDoneBlog> | facebook <https://www.facebook.com/GetItDoneTasks> | google+ <https://plus.google.com/u/0/b/116698933997097774721/+Marcucio/posts>
On Mar 30, 2017, at 10:09 AM, contra ***@***.***> wrote:
We could pull in the SNP data from the NIH?
ftp://ftp.ncbi.nih.gov/snp
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub <#23 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AA4JXh_51Cm8C33VGiUFmvQORaPmCjxdks5rq7eogaJpZM4MteBk>.
|
slightly offtopic, but I also wrote a small app that pulls gene data from snpedia.com:
so we can do:
And this is what we need to do to query the genome:
Maybe I will see if I can make something simular to download and format the snp data into json |
@marcucio It would be really cool to have a repo with the files and publish it as a node module - similar to this kind of a setup: https://github.com/contra/boundaries Or if we could put it all in one JSON file and publish it, this repo could lean on it as a dependency for the parser. |
BTW - that's really cool for pulling from SNPedia. I feel like we could do something interesting there, either one module for all genes or one for each (tedious to maintain). |
I was thinking one repo for all genes/genosets, my idea was to make an app that would report on all known genes/genosets found and any known data about it. Maybe the report will even link to sites like this: https://ghr.nlm.nih.gov/gene/KCNQ2 <https://ghr.nlm.nih.gov/gene/KCNQ2> or this http://diseases.jensenlab.org/Search?query=KCNQ2 <http://diseases.jensenlab.org/Search?query=KCNQ2>
Having as a separate repo that contains all genes would make sense in my case so that I will only have to import 1 project and it will update to the latest info anytime I do "npm update”.
|
does this library support the fastq format:
https://en.wikipedia.org/wiki/FASTQ_format
The text was updated successfully, but these errors were encountered: