Skip to content

Example of running SQL on Kudu tables via Spark SQL

Notifications You must be signed in to change notification settings

CattMarlin/spark-sql-kudu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spark SQL Kudu Example

Template to run SQL on Kudu tables and save to Hive

This example provides a YAML template & code base to run SQL on Kudu tables with Spark. The YAML file is read from HDFS but can be configured to be read from the edge node.

How to use

  • Edit the queries.yml to include any queries you want to run. List the Kudu tables. Hive tables can be included in the query as well.
  • Edit database location and Run the job with run-spark-sql-kudu.sh.

About

Example of running SQL on Kudu tables via Spark SQL

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published