-
Notifications
You must be signed in to change notification settings - Fork 28.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-7084] improve saveAsTable documentation #5654
Conversation
Can one of the admins verify this patch? |
@@ -1085,6 +1085,9 @@ class DataFrame private[sql]( | |||
* there is no notion of a persisted catalog in a standard SQL context. Instead you can write | |||
* an RDD out to a parquet file, and then register that file as a table. This "table" can then | |||
* be the target of an `insertInto`. | |||
* | |||
* Also note that this doesn't create a hive table, but instead creates a Spark data source table. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe say
"Also note that while this function can persist the table metadata into Hive's metastore, the table will NOT be accessible from Hive."
Updated. |
Jenkins, test this please. |
Test build #30856 has started for PR 5654 at commit |
Test build #30856 has finished for PR 5654 at commit
|
Test PASSed. |
@@ -1085,6 +1085,9 @@ class DataFrame private[sql]( | |||
* there is no notion of a persisted catalog in a standard SQL context. Instead you can write | |||
* an RDD out to a parquet file, and then register that file as a table. This "table" can then | |||
* be the target of an `insertInto`. | |||
* | |||
* Also note that while this function can persist the table metadata into Hive's metastore, | |||
* the table will NOT be accessible from Hive. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There are multiple saveAsTable functions. Do you mind updating them as well?
Added for other methods also. |
Can one of the admins verify this patch? |
Jenkins, test this please. |
Merged build triggered. |
Merged build started. |
Test build #32428 has started for PR 5654 at commit |
LGTM. I will merge it after Jenkins comes back happy. |
Test build #32428 has finished for PR 5654 at commit
|
Merged build finished. Test PASSed. |
Test PASSed. |
Author: madhukar <phatak.dev@gmail.com> Closes #5654 from phatak-dev/master and squashes the following commits: 386f407 [madhukar] #5654 updated for all the methods 2c997c5 [madhukar] Merge branch 'master' of https://github.com/apache/spark 00bc819 [madhukar] Merge branch 'master' of https://github.com/apache/spark 2a802c6 [madhukar] #5654 updated the doc according to comments 866e8df [madhukar] [SPARK-7084] improve saveAsTable documentation (cherry picked from commit 57255dc) Signed-off-by: Reynold Xin <rxin@databricks.com>
Author: madhukar <phatak.dev@gmail.com> Closes #5654 from phatak-dev/master and squashes the following commits: 386f407 [madhukar] #5654 updated for all the methods 2c997c5 [madhukar] Merge branch 'master' of https://github.com/apache/spark 00bc819 [madhukar] Merge branch 'master' of https://github.com/apache/spark 2a802c6 [madhukar] #5654 updated the doc according to comments 866e8df [madhukar] [SPARK-7084] improve saveAsTable documentation (cherry picked from commit 57255dc) Signed-off-by: Reynold Xin <rxin@databricks.com>
FYI I submitted a small patch on top of this to add a link to a jira ticket: #6067 |
Author: madhukar <phatak.dev@gmail.com> Closes apache#5654 from phatak-dev/master and squashes the following commits: 386f407 [madhukar] apache#5654 updated for all the methods 2c997c5 [madhukar] Merge branch 'master' of https://github.com/apache/spark 00bc819 [madhukar] Merge branch 'master' of https://github.com/apache/spark 2a802c6 [madhukar] apache#5654 updated the doc according to comments 866e8df [madhukar] [SPARK-7084] improve saveAsTable documentation
Author: madhukar <phatak.dev@gmail.com> Closes apache#5654 from phatak-dev/master and squashes the following commits: 386f407 [madhukar] apache#5654 updated for all the methods 2c997c5 [madhukar] Merge branch 'master' of https://github.com/apache/spark 00bc819 [madhukar] Merge branch 'master' of https://github.com/apache/spark 2a802c6 [madhukar] apache#5654 updated the doc according to comments 866e8df [madhukar] [SPARK-7084] improve saveAsTable documentation
请问,我使用spark1.3.1的sparksql中dataframe的一个saveastable方法存了一个dataframe,但是用hive读的时候出错了Failed with exception java.io.IOException:java.io.IOException: hdfs://namenode71:8020/user/hive/warehouse/zz5/part-00000 not a SequenceFile。这个怎么破?1.2是可以的 |
Author: madhukar <phatak.dev@gmail.com> Closes apache#5654 from phatak-dev/master and squashes the following commits: 386f407 [madhukar] apache#5654 updated for all the methods 2c997c5 [madhukar] Merge branch 'master' of https://github.com/apache/spark 00bc819 [madhukar] Merge branch 'master' of https://github.com/apache/spark 2a802c6 [madhukar] apache#5654 updated the doc according to comments 866e8df [madhukar] [SPARK-7084] improve saveAsTable documentation
No description provided.