Changes to RDD utility files for new variant schema #114

nealsid · 2014-02-18T19:47:01Z

After this I will work on cleaning the build.

…t, ADAMVariantContextRDDFunctions, FieldEnumerationSuite

fnothaft · 2014-02-18T20:00:20Z

adam-core/src/main/scala/edu/berkeley/cs/amplab/adam/rdd/AdamRDDFunctions.scala

@@ -151,15 +145,14 @@ class AdamRecordRDDFunctions(rdd: RDD[ADAMRecord]) extends Serializable with Log
     * @param r Read to map.
     * @return List containing one or two mapping key/value pairs.
     */
-    def mapToBucket (r: ADAMRecord): List[(ReferencePosition, ADAMRecord)] = {
+    def mapToBucket (r: ADAMRecord): List[(Long, ADAMRecord)] = {


Are these changes intended? It looks like they revert to older code?

Agreed; was that intentional? If not, this is the other cause of the build failure.

Fixed, thanks!

AmplabJenkins · 2014-02-18T20:04:16Z

One or more automated tests failed
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/ADAM-prb/130/

fnothaft · 2014-02-18T20:44:58Z

adam-core/src/main/scala/edu/berkeley/cs/amplab/adam/util/ParquetFileTraversable.scala

@@ -31,7 +31,7 @@ class ParquetFileTraversable[T <: IndexedRecord](sc: SparkContext, file: Path) e
    }
    val status = fs.getFileStatus(file)
    var paths = List[Path]()
-    if (status.isDir) {
+    if (status.isDirectory) {


Just as a heads up, this is not reverse compatible with Hadoop 1.

This should probably be discussed in some detail.

At the least, this change should also add some text into the POM saying that only 2.2 is supported (or equivalent).

For us here, this won't affect us. However, I feel like we could get by with waiting until 0.7.0 for this change, or possibly even a 0.6.2 release.

It won't affect us here at Sinai, either. Do we have anyone using Hadoop 1.x?

I'd vote for removing the comment about pre-2.2 from the POM instead.

We've been running Hadoop 1.0.4. I'll see if we can move to Hadoop 2 for all our work.

Nevermind, we're fine with Hadoop 2. Let's just make the move then.

Wait. Why would be break Hadoop 1 compatibility if we can easily avoid it?

Can we just us "status.isDir" here and then open a discussion with the broader group on the mailing list? If everyone is ok with dropping Hadoop 1.x support, that's fine. We shouldn't decide that here.

I reverted this change.

fnothaft · 2014-02-18T20:45:33Z

Looks good to me @nealsid !

fnothaft · 2014-02-18T22:09:21Z

Do you want this to merge in now, or want to wait for more reviews?

tdanford · 2014-02-18T22:19:56Z

Wait a few more hours please...

On Tue, Feb 18, 2014 at 5:09 PM, Frank Austin Nothaft
notifications@github.com wrote:

Do you want this to merge in now, or want to wait for more reviews?

Reply to this email directly or view it on GitHub:
#114 (comment)

carlyeks · 2014-02-18T22:25:58Z

adam-core/src/main/scala/edu/berkeley/cs/amplab/adam/rdd/variation/ADAMVariationContext.scala

@@ -0,0 +1,56 @@
+/*
+ * Copyright (c) 2013. Mount Sinai School of Medicine


This should probably be updated to 2014

AmplabJenkins · 2014-02-19T01:04:28Z

One or more automated tests failed
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/ADAM-prb/134/

carlyeks · 2014-02-20T00:54:57Z

@nealsid: Is this ready to be merged, or did you want to address your comment first?

AmplabJenkins · 2014-02-20T01:11:38Z

One or more automated tests failed
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/ADAM-prb/142/

nealsid · 2014-02-20T03:45:39Z

It's ready to merge, sorry, I should have clarified that.

Changes to RDD utility files for new variant schema

fnothaft · 2014-02-20T04:21:47Z

Thanks @nealsid! Merged.

Neal Sidhwaney added 3 commits February 18, 2014 12:09

RDD functions for new variant schema

2375b74

New test suites for RichADAMVariant, AdamContext, AdamVariationContex…

137253e

…t, ADAMVariantContextRDDFunctions, FieldEnumerationSuite

Remove out-of-date reference to CompareAdam

6abf6b1

fnothaft reviewed Feb 18, 2014
View reviewed changes

carlyeks reviewed Feb 18, 2014
View reviewed changes

Code review changes

7e0ed42

Fix tests & build of adam-core

3d3d66e

fnothaft added a commit that referenced this pull request Feb 20, 2014

Merge pull request #114 from nealsid/vcf-work-rdd

37bb148

Changes to RDD utility files for new variant schema

fnothaft merged commit 37bb148 into bigdatagenomics:vcf-work Feb 20, 2014

nealsid deleted the vcf-work-rdd branch February 20, 2014 15:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changes to RDD utility files for new variant schema #114

Changes to RDD utility files for new variant schema #114

nealsid commented Feb 18, 2014

fnothaft Feb 18, 2014

carlyeks Feb 18, 2014

nealsid Feb 20, 2014

AmplabJenkins commented Feb 18, 2014

fnothaft Feb 18, 2014

carlyeks Feb 18, 2014

hammer Feb 18, 2014

carlyeks Feb 18, 2014

fnothaft Feb 18, 2014

fnothaft Feb 18, 2014

massie Feb 18, 2014

nealsid Feb 20, 2014

fnothaft commented Feb 18, 2014

fnothaft commented Feb 18, 2014

tdanford commented Feb 18, 2014

Do you want this to merge in now, or want to wait for more reviews?

carlyeks Feb 18, 2014

nealsid Feb 20, 2014

AmplabJenkins commented Feb 19, 2014

carlyeks commented Feb 20, 2014

AmplabJenkins commented Feb 20, 2014

nealsid commented Feb 20, 2014

fnothaft commented Feb 20, 2014

		@@ -0,0 +1,56 @@
		/*
		* Copyright (c) 2013. Mount Sinai School of Medicine

Changes to RDD utility files for new variant schema #114

Changes to RDD utility files for new variant schema #114

Conversation

nealsid commented Feb 18, 2014

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AmplabJenkins commented Feb 18, 2014

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fnothaft commented Feb 18, 2014

fnothaft commented Feb 18, 2014

tdanford commented Feb 18, 2014

Do you want this to merge in now, or want to wait for more reviews?

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AmplabJenkins commented Feb 19, 2014

carlyeks commented Feb 20, 2014

AmplabJenkins commented Feb 20, 2014

nealsid commented Feb 20, 2014

fnothaft commented Feb 20, 2014