[SPARK-7084] improve saveAsTable documentation #5654

phatak-dev · 2015-04-23T08:13:16Z

No description provided.

AmplabJenkins · 2015-04-23T08:17:11Z

Can one of the admins verify this patch?

rxin · 2015-04-23T08:45:21Z

sql/core/src/main/scala/org/apache/spark/sql/DataFrame.scala

@@ -1085,6 +1085,9 @@ class DataFrame private[sql](
   * there is no notion of a persisted catalog in a standard SQL context.  Instead you can write
   * an RDD out to a parquet file, and then register that file as a table.  This "table" can then
   * be the target of an `insertInto`.
+   *
+   * Also note that this doesn't create a hive table, but instead creates a Spark data source table.


Maybe say

"Also note that while this function can persist the table metadata into Hive's metastore, the table will NOT be accessible from Hive."

phatak-dev · 2015-04-23T10:05:00Z

Updated.

rxin · 2015-04-23T17:32:46Z

Jenkins, test this please.

SparkQA · 2015-04-23T17:37:50Z

Test build #30856 has started for PR 5654 at commit 00bc819.

SparkQA · 2015-04-23T19:13:57Z

Test build #30856 has finished for PR 5654 at commit 00bc819.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- class FreqItemset(namedtuple("FreqItemset", ["items", "freq"])):
- protected[sql] abstract class AtomicType extends DataType
- abstract class NumericType extends AtomicType
- class Encoder[T <: AtomicType](columnType: NativeColumnType[T]) extends compression.Encoder[T]
- class Decoder[T <: AtomicType](buffer: ByteBuffer, columnType: NativeColumnType[T])
- class Encoder[T <: AtomicType](columnType: NativeColumnType[T]) extends compression.Encoder[T]
- class Decoder[T <: AtomicType](buffer: ByteBuffer, columnType: NativeColumnType[T])
- class Encoder[T <: AtomicType](columnType: NativeColumnType[T]) extends compression.Encoder[T]
- class Decoder[T <: AtomicType](buffer: ByteBuffer, columnType: NativeColumnType[T])
This patch does not change any dependencies.

AmplabJenkins · 2015-04-23T19:14:02Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30856/
Test PASSed.

rxin · 2015-04-23T19:17:19Z

sql/core/src/main/scala/org/apache/spark/sql/DataFrame.scala

@@ -1085,6 +1085,9 @@ class DataFrame private[sql](
   * there is no notion of a persisted catalog in a standard SQL context.  Instead you can write
   * an RDD out to a parquet file, and then register that file as a table.  This "table" can then
   * be the target of an `insertInto`.
+   *
+   * Also note that while this function can persist the table metadata into Hive's metastore,
+   * the table will NOT be accessible from Hive.


There are multiple saveAsTable functions. Do you mind updating them as well?

phatak-dev · 2015-04-24T04:28:56Z

Added for other methods also.

AmplabJenkins · 2015-04-27T18:18:02Z

Can one of the admins verify this patch?

rxin · 2015-05-11T21:56:56Z

Jenkins, test this please.

AmplabJenkins · 2015-05-11T21:57:13Z

Merged build triggered.

AmplabJenkins · 2015-05-11T21:57:20Z

Merged build started.

SparkQA · 2015-05-11T21:59:14Z

Test build #32428 has started for PR 5654 at commit 386f407.

rxin · 2015-05-11T22:04:28Z

LGTM. I will merge it after Jenkins comes back happy.

SparkQA · 2015-05-11T23:59:46Z

Test build #32428 has finished for PR 5654 at commit 386f407.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2015-05-11T23:59:51Z

Merged build finished. Test PASSed.

AmplabJenkins · 2015-05-11T23:59:51Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/32428/
Test PASSed.

Author: madhukar <phatak.dev@gmail.com> Closes #5654 from phatak-dev/master and squashes the following commits: 386f407 [madhukar] #5654 updated for all the methods 2c997c5 [madhukar] Merge branch 'master' of https://github.com/apache/spark 00bc819 [madhukar] Merge branch 'master' of https://github.com/apache/spark 2a802c6 [madhukar] #5654 updated the doc according to comments 866e8df [madhukar] [SPARK-7084] improve saveAsTable documentation (cherry picked from commit 57255dc) Signed-off-by: Reynold Xin <rxin@databricks.com>

rxin · 2015-05-12T00:11:26Z

FYI I submitted a small patch on top of this to add a link to a jira ticket: #6067

Author: madhukar <phatak.dev@gmail.com> Closes apache#5654 from phatak-dev/master and squashes the following commits: 386f407 [madhukar] apache#5654 updated for all the methods 2c997c5 [madhukar] Merge branch 'master' of https://github.com/apache/spark 00bc819 [madhukar] Merge branch 'master' of https://github.com/apache/spark 2a802c6 [madhukar] apache#5654 updated the doc according to comments 866e8df [madhukar] [SPARK-7084] improve saveAsTable documentation

chiyingyunhua · 2015-06-18T02:01:58Z

请问，我使用spark1.3.1的sparksql中dataframe的一个saveastable方法存了一个dataframe，但是用hive读的时候出错了Failed with exception java.io.IOException:java.io.IOException: hdfs://namenode71:8020/user/hive/warehouse/zz5/part-00000 not a SequenceFile。这个怎么破？1.2是可以的

Author: madhukar <phatak.dev@gmail.com> Closes apache#5654 from phatak-dev/master and squashes the following commits: 386f407 [madhukar] apache#5654 updated for all the methods 2c997c5 [madhukar] Merge branch 'master' of https://github.com/apache/spark 00bc819 [madhukar] Merge branch 'master' of https://github.com/apache/spark 2a802c6 [madhukar] apache#5654 updated the doc according to comments 866e8df [madhukar] [SPARK-7084] improve saveAsTable documentation

[SPARK-7084] improve saveAsTable documentation

866e8df

rxin reviewed Apr 23, 2015
View reviewed changes

phatak-dev added 2 commits April 23, 2015 15:33

#5654 updated the doc according to comments

2a802c6

Merge branch 'master' of https://github.com/apache/spark

00bc819

rxin reviewed Apr 23, 2015
View reviewed changes

phatak-dev added 2 commits April 24, 2015 09:51

Merge branch 'master' of https://github.com/apache/spark

2c997c5

#5654 updated for all the methods

386f407

asfgit closed this in 57255dc May 12, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-7084] improve saveAsTable documentation #5654

[SPARK-7084] improve saveAsTable documentation #5654

phatak-dev commented Apr 23, 2015

AmplabJenkins commented Apr 23, 2015

rxin Apr 23, 2015

phatak-dev commented Apr 23, 2015

rxin commented Apr 23, 2015

SparkQA commented Apr 23, 2015

SparkQA commented Apr 23, 2015

AmplabJenkins commented Apr 23, 2015

rxin Apr 23, 2015

phatak-dev commented Apr 24, 2015

AmplabJenkins commented Apr 27, 2015

rxin commented May 11, 2015

AmplabJenkins commented May 11, 2015

AmplabJenkins commented May 11, 2015

SparkQA commented May 11, 2015

rxin commented May 11, 2015

SparkQA commented May 11, 2015

AmplabJenkins commented May 11, 2015

AmplabJenkins commented May 11, 2015

rxin commented May 12, 2015

chiyingyunhua commented Jun 18, 2015

[SPARK-7084] improve saveAsTable documentation #5654

[SPARK-7084] improve saveAsTable documentation #5654

Conversation

phatak-dev commented Apr 23, 2015

AmplabJenkins commented Apr 23, 2015

rxin Apr 23, 2015

Choose a reason for hiding this comment

phatak-dev commented Apr 23, 2015

rxin commented Apr 23, 2015

SparkQA commented Apr 23, 2015

SparkQA commented Apr 23, 2015

AmplabJenkins commented Apr 23, 2015

rxin Apr 23, 2015

Choose a reason for hiding this comment

phatak-dev commented Apr 24, 2015

AmplabJenkins commented Apr 27, 2015

rxin commented May 11, 2015

AmplabJenkins commented May 11, 2015

AmplabJenkins commented May 11, 2015

SparkQA commented May 11, 2015

rxin commented May 11, 2015

SparkQA commented May 11, 2015

AmplabJenkins commented May 11, 2015

AmplabJenkins commented May 11, 2015

rxin commented May 12, 2015

chiyingyunhua commented Jun 18, 2015