SPARK-4968: takeOrdered to skip reduce step in case mappers return no partitions #3830

saucam · 2014-12-29T19:20:19Z

takeOrdered should skip reduce step in case mapped RDDs have no partitions. This prevents the mentioned exception :

run query
SELECT * FROM testTable WHERE market = 'market2' ORDER BY End_Time DESC LIMIT 100;
Error trace
java.lang.UnsupportedOperationException: empty collection
at org.apache.spark.rdd.RDD$$anonfun$reduce$1.apply(RDD.scala:863)
at org.apache.spark.rdd.RDD$$anonfun$reduce$1.apply(RDD.scala:863)
at scala.Option.getOrElse(Option.scala:120)
at org.apache.spark.rdd.RDD.reduce(RDD.scala:863)
at org.apache.spark.rdd.RDD.takeOrdered(RDD.scala:1136)

… partitions

AmplabJenkins · 2014-12-29T19:22:09Z

Can one of the admins verify this patch?

rxin · 2014-12-29T20:08:19Z

Jenkins, test this please.

SparkQA · 2014-12-29T20:12:34Z

Test build #24867 has started for PR 3830 at commit 5974d10.

This patch merges cleanly.

SparkQA · 2014-12-29T21:36:06Z

Test build #24867 has finished for PR 3830 at commit 5974d10.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2014-12-29T21:36:10Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24867/
Test PASSed.

rxin · 2014-12-29T21:49:29Z

Merging in master & branch-1.2. Thanks.

… partitions takeOrdered should skip reduce step in case mapped RDDs have no partitions. This prevents the mentioned exception : 4. run query SELECT * FROM testTable WHERE market = 'market2' ORDER BY End_Time DESC LIMIT 100; Error trace java.lang.UnsupportedOperationException: empty collection at org.apache.spark.rdd.RDD$$anonfun$reduce$1.apply(RDD.scala:863) at org.apache.spark.rdd.RDD$$anonfun$reduce$1.apply(RDD.scala:863) at scala.Option.getOrElse(Option.scala:120) at org.apache.spark.rdd.RDD.reduce(RDD.scala:863) at org.apache.spark.rdd.RDD.takeOrdered(RDD.scala:1136) Author: Yash Datta <Yash.Datta@guavus.com> Closes #3830 from saucam/fix_takeorder and squashes the following commits: 5974d10 [Yash Datta] SPARK-4968: takeOrdered to skip reduce step in case mappers return no partitions (cherry picked from commit 9bc0df6) Signed-off-by: Reynold Xin <rxin@databricks.com>

SPARK-4968: takeOrdered to skip reduce step in case mappers return no…

5974d10

… partitions

asfgit closed this in 9bc0df6 Dec 29, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SPARK-4968: takeOrdered to skip reduce step in case mappers return no partitions #3830

SPARK-4968: takeOrdered to skip reduce step in case mappers return no partitions #3830

saucam commented Dec 29, 2014

AmplabJenkins commented Dec 29, 2014

rxin commented Dec 29, 2014

SparkQA commented Dec 29, 2014

SparkQA commented Dec 29, 2014

AmplabJenkins commented Dec 29, 2014

rxin commented Dec 29, 2014

SPARK-4968: takeOrdered to skip reduce step in case mappers return no partitions #3830

SPARK-4968: takeOrdered to skip reduce step in case mappers return no partitions #3830

Conversation

saucam commented Dec 29, 2014

AmplabJenkins commented Dec 29, 2014

rxin commented Dec 29, 2014

SparkQA commented Dec 29, 2014

SparkQA commented Dec 29, 2014

AmplabJenkins commented Dec 29, 2014

rxin commented Dec 29, 2014