-
Notifications
You must be signed in to change notification settings - Fork 309
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ADAM-783] Write @SQ header lines in sorted order. #784
Conversation
Test FAILed. Build result: FAILUREGitHub pull request #784 of commit 2b3ed5e automatically merged.Notifying endpoint 'HTTP:https://webhooks.gitter.im/e/ac8bb6e9f53357bc8aa8'[EnvInject] - Loading node environment variables.Building remotely on amp-jenkins-worker-05 (centos spark-test) in workspace /home/jenkins/workspace/ADAM-prb > git rev-parse --is-inside-work-tree # timeout=10Fetching changes from the remote Git repository > git config remote.origin.url https://github.com/bigdatagenomics/adam.git # timeout=10Fetching upstream changes from https://github.com/bigdatagenomics/adam.git > git --version # timeout=10 > git fetch --tags --progress https://github.com/bigdatagenomics/adam.git +refs/pull/:refs/remotes/origin/pr/ > git rev-parse origin/pr/784/merge^{commit} # timeout=10 > git branch -a --contains 53d70081fdfe9797c24be895796e68d8f567ec80 # timeout=10 > git rev-parse remotes/origin/pr/784/merge^{commit} # timeout=10Checking out Revision 53d70081fdfe9797c24be895796e68d8f567ec80 (origin/pr/784/merge) > git config core.sparsecheckout # timeout=10 > git checkout -f 53d70081fdfe9797c24be895796e68d8f567ec80First time build. Skipping changelog.Triggering ADAM-prb ? 2.6.0,2.10,1.4.1,centosTriggering ADAM-prb ? 2.6.0,2.11,1.4.1,centosTouchstone configurations resulted in FAILURE, so aborting...Notifying endpoint 'HTTP:https://webhooks.gitter.im/e/ac8bb6e9f53357bc8aa8'Test FAILed. |
lgtm from a quick pass |
Whoops! Forgot to add the test collateral... |
Test PASSed. |
Rebased. |
Test PASSed. |
Rerebased. |
Test PASSed. |
This now resolves #760 as well. Can I get review/merge? |
Test PASSed. |
Ping on review/merge... This is the last issue pending for the 0.17.1 release. |
looking |
Thanks @ryan-williams! |
This lgtm; one question: this optionally sorts the header lines to match the sort of the reads that can be optionally done with That seems reasonable, but feels a little backwards, since the spec says that the If ADAM will only ever output a lex-sort of the |
Finally, I don't know what the preferred way of merging is these days :) |
Thanks for the review! @ryan-williams for now, we're just emitting coordinate sorted order. The other orders are defined here and are I agree it is a bit backwards, but I think an equivalent way to read it is that you need to have the same lexicographic order for both the reads and the header. We can continue using the merge button for now! After I've got scripts ready, we can cut over. |
FWIW,
My point, here and in #794, is that the SAM spec doesn't actually say that Arguably we should be able to leave Anyway, unless this sounds like such a good idea to you that you want to do it here, I'm find to merge this and then address the possibility of |
This change resolves bigdatagenomics#783. Specifically, now we write the SAM/BAM @sq header lines using the same lexicographic ordering that we use for sorting records.
Sets the header line "@hd" sort order to "coordinate" when saving a sorted file in BAM.
Rebased. |
k, i'll merge when test passes |
@ryan-williams doesn't that snippet just specify what the I get what you're saying. I am personally OK with rewriting the |
ah, you mean that they don't explicitly say what Yea, we can discuss further on #794; always sorting |
Test PASSed. |
[ADAM-783] Write @sq header lines in sorted order.
This change resolves #783 and #760. Specifically, now we write the SAM/BAM @sq header
lines using the same lexicographic ordering that we use for sorting records, and we write the @hd line to note that we are sorted in coordinate order.