[SparkR][Doc] fix typo in vignettes #17884

actuaryzhang · 2017-05-06T18:40:34Z

What changes were proposed in this pull request?

Fix typo in vignettes

actuaryzhang · 2017-05-06T18:40:45Z

@felixcheung

SparkQA · 2017-05-06T19:13:11Z

Test build #76527 has finished for PR 17884 at commit 8639025.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2017-05-06T20:50:34Z

I know it is legitimate but It would be worth double checking other typos too. Usually, single typo PR is not encouraged up to my knowledge given reviwing, building and merging costs.

actuaryzhang · 2017-05-06T22:45:31Z

@HyukjinKwon Thanks for pointing this out. I will keep this in mind next time.

felixcheung · 2017-05-07T01:21:06Z

This test seems flaky on AppVeyor, not sure why

Failed -------------------------------------------------------------------------
1. Error: spark.glm and predict (@test_mllib_regression.R#57) ------------------
java.lang.IllegalStateException: SparkContext has been shutdown
	at org.apache.spark.SparkContext.runJob(SparkContext.scala:2015)
	at org.apache.spark.SparkContext.runJob(SparkContext.scala:2044)
	at org.apache.spark.SparkContext.runJob(SparkContext.scala:2063)
	at org.apache.spark.sql.execution.SparkPlan.executeTake(SparkPlan.scala:333)
	at org.apache.spark.sql.execution.CollectLimitExec.executeCollect(limit.scala:38)
	at org.apache.spark.sql.Dataset.org$apache$spark$sql$Dataset$$collectFromPlan(Dataset.scala:2923)
	at org.apache.spark.sql.Dataset$$anonfun$head$1.apply(Dataset.scala:2237)
	at org.apache.spark.sql.Dataset$$anonfun$head$1.apply(Dataset.scala:2237)
	at org.apache.spark.sql.Dataset$$anonfun$57.apply(Dataset.scala:2907)
	at org.apache.spark.sql.execution.SQLExecution$.withNewExecutionId(SQLExecution.scala:65)
	at org.apache.spark.sql.Dataset.withAction(Dataset.scala:2906)
	at org.apache.spark.sql.Dataset.head(Dataset.scala:2237)
	at org.apache.spark.sql.Dataset.head(Dataset.scala:2244)
	at org.apache.spark.sql.Dataset.first(Dataset.scala:2251)

felixcheung · 2017-05-07T01:21:47Z

@actuaryzhang thanks - would you have a chance to run a quick QA check on the rest of the vignettes, if you haven't already?

actuaryzhang · 2017-05-07T03:22:35Z

@felixcheung I ran a quick QA on the vignettes and fixed some additional typos and styles.

SparkQA · 2017-05-07T03:53:18Z

Test build #76534 has finished for PR 17884 at commit 796a8e7.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

felixcheung

awesome, thanks! just one minor request

felixcheung · 2017-05-07T20:03:31Z

R/pkg/vignettes/sparkr-vignettes.Rmd

@@ -405,7 +405,7 @@ result <- gapply(
 head(arrange(result, "max_mpg", decreasing = TRUE))
 ```

-Like gapply, `gapplyCollect` applies a function to each partition of a `SparkDataFrame` and collect the result back to R `data.frame`. The output of the function should be a `data.frame` but no schema is required in this case. Note that `gapplyCollect` can fail if the output of UDF run on all the partition cannot be pulled to the driver and fit in driver memory.
+Like gapply, `gapplyCollect` applies a function to each partition of a `SparkDataFrame` and collect the result back to R `data.frame`. The output of the function should be a `data.frame` but no schema is required in this case. Note that `gapplyCollect` can fail if the output of the UDF on all partitions cannot be pulled into the driver's memory.


could you add backtick to gapply at the beginning

felixcheung · 2017-05-07T20:04:48Z

R/pkg/vignettes/sparkr-vignettes.Rmd

@@ -1079,19 +1079,19 @@ There are three main object classes in SparkR you may be working with.
    + `sdf` stores a reference to the corresponding Spark Dataset in the Spark JVM backend.
    + `env` saves the meta-information of the object such as `isCached`.

-It can be created by data import methods or by transforming an existing `SparkDataFrame`. We can manipulate `SparkDataFrame` by numerous data processing functions and feed that into machine learning algorithms.
+    It can be created by data import methods or by transforming an existing `SparkDataFrame`. We can manipulate `SparkDataFrame` by numerous data processing functions and feed that into machine learning algorithms.


just curious, does this whitespace in front of paragraph get handled properly?

@felixcheung Yes, the four spaces indicate that the text following should be aligned with the bullet point. Otherwise, it will start as a new paragraph and have the wrong indention.
You will see the difference after compiling the Rmarkdown file.

cool! thanks

SparkQA · 2017-05-07T23:50:43Z

Test build #76555 has finished for PR 17884 at commit b0407b5.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

## What changes were proposed in this pull request? Fix typo in vignettes Author: Wayne Zhang <actuaryzhang@uber.com> Closes #17884 from actuaryzhang/typo. (cherry picked from commit 2fdaeb5) Signed-off-by: Felix Cheung <felixcheung@apache.org>

felixcheung · 2017-05-08T06:22:25Z

merged to master/2.2
thanks!

## What changes were proposed in this pull request? Fix typo in vignettes Author: Wayne Zhang <actuaryzhang@uber.com> Closes apache#17884 from actuaryzhang/typo.

fix typo in vignettes

8639025

fix typo and style in vignettes

796a8e7

felixcheung approved these changes May 7, 2017

View reviewed changes

address comments

b0407b5

asfgit closed this in 2fdaeb5 May 8, 2017

actuaryzhang deleted the typo branch May 8, 2017 06:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SparkR][Doc] fix typo in vignettes #17884

[SparkR][Doc] fix typo in vignettes #17884

actuaryzhang commented May 6, 2017

actuaryzhang commented May 6, 2017

SparkQA commented May 6, 2017

HyukjinKwon commented May 6, 2017

actuaryzhang commented May 6, 2017

felixcheung commented May 7, 2017

felixcheung commented May 7, 2017

actuaryzhang commented May 7, 2017

SparkQA commented May 7, 2017

felixcheung left a comment

felixcheung May 7, 2017

actuaryzhang May 7, 2017

felixcheung May 7, 2017

actuaryzhang May 7, 2017 •

edited

Loading

felixcheung May 7, 2017

SparkQA commented May 7, 2017

felixcheung commented May 8, 2017

[SparkR][Doc] fix typo in vignettes #17884

[SparkR][Doc] fix typo in vignettes #17884

Conversation

actuaryzhang commented May 6, 2017

What changes were proposed in this pull request?

actuaryzhang commented May 6, 2017

SparkQA commented May 6, 2017

HyukjinKwon commented May 6, 2017

actuaryzhang commented May 6, 2017

felixcheung commented May 7, 2017

felixcheung commented May 7, 2017

actuaryzhang commented May 7, 2017

SparkQA commented May 7, 2017

felixcheung left a comment

Choose a reason for hiding this comment

felixcheung May 7, 2017

Choose a reason for hiding this comment

actuaryzhang May 7, 2017

Choose a reason for hiding this comment

felixcheung May 7, 2017

Choose a reason for hiding this comment

actuaryzhang May 7, 2017 • edited Loading

Choose a reason for hiding this comment

felixcheung May 7, 2017

Choose a reason for hiding this comment

SparkQA commented May 7, 2017

felixcheung commented May 8, 2017

actuaryzhang May 7, 2017 •

edited

Loading