Not able to use Z-Order spark extension #2612
-
Hi Kyuubi Community, I'm trying to use Kyuubi's Z-Order extension alone to optimize the hive table in S3, but I met with this issue.
When I run the script, the error occurs: Kyuubi: 1.5.1 I have struggled for quite a while but cannot find any clue, any help will be much appreciated! |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 3 replies
-
hi @Dearkano look at the error message, it seems the kyuubi extension does not work, please make sure the config Another thing you may need to know is, for now Kyuubi only support optimize Hive table. That said, we do not support the optimize Here is the Z-Order docs. |
Beta Was this translation helpful? Give feedback.
-
Hi @ulysses-you , thank you for your reply!! I also confirmed that jar is in the right place since if I removed that jar, and ran the script again, the following error would occur:
Also, for the case, you said that Kyuubi only supports hive table, I'm using hive on S3 with Glue as the metastore and queried by Spark/Trino (no hive server or thrift service), my intention is to take advantage of Kyuubi's Z-Order optimize on parquet files instead of using the whole Kyuubi. I'm not sure if this is supported by Kyuubi? Thank you again! |
Beta Was this translation helpful? Give feedback.
-
Hi @ulysses-you thank you for investigating! |
Beta Was this translation helpful? Give feedback.
hi @Dearkano look at the error message, it seems the kyuubi extension does not work, please make sure the config
spark.sql.extensions=org.apache.kyuubi.sql.KyuubiSparkSQLExtension
actually apply. In case your spark session is fromget
rather thancreate
, you can add the config intoconf/spark-defaults.conf
and restart the pyspark application.Another thing you may need to know is, for now Kyuubi only support optimize Hive table. That said, we do not support the optimize
sqlOnFiles
which is your case.Here is the Z-Order docs.