-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature Request] Support for Spark Connect (aka Delta Connect) #3240
Labels
enhancement
New feature or request
Milestone
Comments
This was referenced Jun 7, 2024
3 tasks
@allisonport-db There are A couple of issues with the above commands for starting the connect server.
|
|
Can't wait for this! ❤️ |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Feature request
Which Delta project/connector is this regarding?
Overview
Spark Connect is a new initiative in Apache Spark that adds a decoupled client-server infrastructure which allows Spark applications to connect remotely to a Spark server and run SQL / Dataframe operations. We want to develop what we're calling "Delta Connect" to allow Delta operations to be made in applications running in such client-server mode.
Further details
These are the CUJs we would like to support:
Server
The server is packaged into the
io.delta:delta-spark-connect-server_2.13
package, installing this package automatically installs theio.delta:delta-spark-connect-common_2.13
package.Scala Client
The client is packaged into the
io.delta:delta-spark-connect-client_2.13
package, installing this package automatically installs theio.delta:delta-spark-connect-common_2.13
package.The
delta-spark-connect-client_2.13
package uses the exact same class and package names as thedelta-spark_2.13
package. Therefore the exact same code can be used as before.Python Client
The Delta Connect Python client is included in the same PyPi package as Delta Spark.
There is no difference in usage compared to the classic way. We just need to pass in a remote SparkSession (instead of a local one) to the DeltaTable API.
The text was updated successfully, but these errors were encountered: