SPARK-1209 [CORE] (Take 2) SparkHadoop{MapRed,MapReduce}Util should not use package org.apache.hadoop #3048

srowen · 2014-11-01T11:44:05Z

@andrewor14 Another try at SPARK-1209, to address #2814 (comment)

I successfully tested with mvn -Dhadoop.version=1.0.4 -DskipTests clean package; mvn -Dhadoop.version=1.0.4 test I assume that is what failed Jenkins last time. I also tried -Dhadoop.version1.2.1 and -Phadoop-2.4 -Pyarn -Phive for more coverage.

So this is why the class was put in org.apache.hadoop to begin with, I assume. One option is to leave this as-is for now and move it only when Hadoop 1.0.x support goes away.

This is the other option, which adds a call to force the constructor to be public at run-time. It's probably less surprising than putting Spark code in org.apache.hadoop, but, does involve reflection. A SecurityManager might forbid this, but it would forbid a lot of stuff Spark does. This would also only affect Hadoop 1.0.x it seems.

….hadoop to org.apache.spark

…o part of the PairRDDFunctions type hierarchy though?

… in later versions

SparkQA · 2014-11-01T11:49:55Z

Test build #22695 has started for PR 3048 at commit 0d48f4b.

This patch merges cleanly.

SparkQA · 2014-11-01T13:28:27Z

Test build #22695 has finished for PR 3048 at commit 0d48f4b.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2014-11-01T13:28:31Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22695/
Test PASSed.

andrewor14 · 2014-11-06T23:53:32Z

LGTM. @pwendell we should consider merging this into 1.2. Can you look at the changes here?

pwendell · 2014-11-10T06:10:49Z

Cool let's pull this in and see how it goes.

…ot use package org.apache.hadoop andrewor14 Another try at SPARK-1209, to address #2814 (comment) I successfully tested with `mvn -Dhadoop.version=1.0.4 -DskipTests clean package; mvn -Dhadoop.version=1.0.4 test` I assume that is what failed Jenkins last time. I also tried `-Dhadoop.version1.2.1` and `-Phadoop-2.4 -Pyarn -Phive` for more coverage. So this is why the class was put in `org.apache.hadoop` to begin with, I assume. One option is to leave this as-is for now and move it only when Hadoop 1.0.x support goes away. This is the other option, which adds a call to force the constructor to be public at run-time. It's probably less surprising than putting Spark code in `org.apache.hadoop`, but, does involve reflection. A `SecurityManager` might forbid this, but it would forbid a lot of stuff Spark does. This would also only affect Hadoop 1.0.x it seems. Author: Sean Owen <sowen@cloudera.com> Closes #3048 from srowen/SPARK-1209 and squashes the following commits: 0d48f4b [Sean Owen] For Hadoop 1.0.x, make certain constructors public, which were public in later versions 466e179 [Sean Owen] Disable MIMA warnings resulting from moving the class -- this was also part of the PairRDDFunctions type hierarchy though? eb61820 [Sean Owen] Move SparkHadoopMapRedUtil / SparkHadoopMapReduceUtil from org.apache.hadoop to org.apache.spark (cherry picked from commit f8e5732) Signed-off-by: Patrick Wendell <pwendell@gmail.com>

srowen added 3 commits November 1, 2014 09:39

Move SparkHadoopMapRedUtil / SparkHadoopMapReduceUtil from org.apache…

eb61820

….hadoop to org.apache.spark

Disable MIMA warnings resulting from moving the class -- this was als…

466e179

…o part of the PairRDDFunctions type hierarchy though?

For Hadoop 1.0.x, make certain constructors public, which were public…

0d48f4b

… in later versions

asfgit closed this in f8e5732 Nov 10, 2014

srowen deleted the SPARK-1209 branch November 11, 2014 15:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SPARK-1209 [CORE] (Take 2) SparkHadoop{MapRed,MapReduce}Util should not use package org.apache.hadoop #3048

SPARK-1209 [CORE] (Take 2) SparkHadoop{MapRed,MapReduce}Util should not use package org.apache.hadoop #3048

srowen commented Nov 1, 2014

SparkQA commented Nov 1, 2014

SparkQA commented Nov 1, 2014

AmplabJenkins commented Nov 1, 2014

andrewor14 commented Nov 6, 2014

pwendell commented Nov 10, 2014

SPARK-1209 [CORE] (Take 2) SparkHadoop{MapRed,MapReduce}Util should not use package org.apache.hadoop #3048

SPARK-1209 [CORE] (Take 2) SparkHadoop{MapRed,MapReduce}Util should not use package org.apache.hadoop #3048

Conversation

srowen commented Nov 1, 2014

SparkQA commented Nov 1, 2014

SparkQA commented Nov 1, 2014

AmplabJenkins commented Nov 1, 2014

andrewor14 commented Nov 6, 2014

pwendell commented Nov 10, 2014