-
Notifications
You must be signed in to change notification settings - Fork 28.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-46293][CONNECT][PYTHON] Use protobuf
transitive dependency
#44221
Conversation
…ncy for Spark Connect
protobuf
to required dependency for Spark Connect
@@ -161,6 +161,7 @@ Package Supported version Note | |||
`numpy` >=1.21 Required for pandas API on Spark and MLLib DataFrame-based API; Optional for Spark SQL | |||
`grpcio` >=1.59.3 Required for Spark Connect | |||
`grpcio-status` >=1.59.3 Required for Spark Connect | |||
`protobuf` ==4.25.1 Required for Spark Connect |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The required version referred to
Line 56 in e013c4e
protobuf==4.25.1 |
@@ -161,6 +161,7 @@ Package Supported version Note | |||
`numpy` >=1.21 Required for pandas API on Spark and MLLib DataFrame-based API; Optional for Spark SQL | |||
`grpcio` >=1.59.3 Required for Spark Connect | |||
`grpcio-status` >=1.59.3 Required for Spark Connect | |||
`protobuf` ==4.25.1 Required for Spark Connect |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
you should fix https://github.com/apache/spark/blob/master/python/setup.py#L325C13-L331 too. in fact I won't add it for now because grpcio
and grpcio-status
requires a specific version of protobuf
, and I think we won't have to duplicate the maintenance here. It is true that we directly use protobuf
in few places but not really a lot of places.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Otherwise we can close this ticket, but just one thing I'm worry about is that if the new version of grpcio
, grpcio-status
requires a protobuf
version other than 4.25.1, I think pinning the specific version could be a problem.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we check if we can remove protobuf dependency, and CI passes fine? if that works, let's remove.
If that doesn't work, we should fix the doc here and setup.py
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure! Sounds reasonable to me.
protobuf
to required dependency for Spark Connectprotobuf
to required dependency for Spark Connect
Let me mark this PR as draft until CI is done. |
protobuf
to required dependency for Spark Connectprotobuf
from required package.
Okay, seems like we can just remove the dependency. Just updated the PR title & description. @HyukjinKwon FYI |
protobuf
from required package.protobuf
from required package.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1, LGTM. Ya, it seems to work fine with the transitive dependency.
protobuf
from required package.protobuf
from required package.
protobuf
from required package.protobuf
transitive dependency
To @itholic and @HyukjinKwon , I revised the PR title because the original PR title is misleading.
|
Merged to master for Apache Spark 4. |
Makes sense to me. Thanks for revising! |
### What changes were proposed in this pull request? This PR proposes to remove `protobuf` from required package. ### Why are the changes needed? `protobuf` is automatically installed when installing `grpcio` and `grpcio-status`, so we don't need to specify the specific version explicitly. ### Does this PR introduce _any_ user-facing change? No API changes. ### How was this patch tested? The existing CI should pass ### Was this patch authored or co-authored using generative AI tooling? No. Closes apache#44221 from itholic/protobuf_docs. Authored-by: Haejoon Lee <haejoon.lee@databricks.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>
What changes were proposed in this pull request?
This PR proposes to remove
protobuf
from required package.Why are the changes needed?
protobuf
is automatically installed when installinggrpcio
andgrpcio-status
, so we don't need to specify the specific version explicitly.Does this PR introduce any user-facing change?
No API changes.
How was this patch tested?
The existing CI should pass
Was this patch authored or co-authored using generative AI tooling?
No.