-
Notifications
You must be signed in to change notification settings - Fork 133
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Moves scaling out of experimental #286
Conversation
a749b00
to
2fa412c
Compare
14c5519
to
c2720cd
Compare
09a20b6
to
f8241da
Compare
- h_spark - h_dask - h_ray All move to plugins. We preserve the name `h_...` to avoid duplicate imports from the library themselves. Note that we also deprecate the idea of having "_implementations". People will be importing these, so we want them name to be easy to refer to/remember. This is why polars_implementations is deprecated, and we instead use h_polars. We leave in references to all previously released constructs (note, the new pyspark API has not been released so its OK to remove that from the experimental.h_spark file).
6d30b24
to
8236b47
Compare
Need a README somewhere too (maybe in plugins) to explain how to add/adjust and name files. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Plugins will get large. We also don't want very large files either.
So the evolution should be into packages, right?
h_spark.py -->
- init.py
- ... modules organized however we want
And then similarly for *_extensions.py...?
Yeah, I think that'll be the best way for it to evolve. Should be able to do it and keep compatible references later on, no need to break into packages yet. |
8236b47
to
a445aa3
Compare
This is for getting it to work with pyspark. While it does have natural column/dataframe objects, these are not usable in the same way. Thus we don't want to register the same set of constructs as we would, say, with polars or pandas. We allow plugins to opt out by specifying COLUMN_FRIENDLY_DF_TYPE = False, which defaults to true.
a445aa3
to
36b7414
Compare
Pyspark has a bunch of hidden depedencies -- these are solved by the sql and connect targets
36b7414
to
f190f6e
Compare
05ce983
to
18e7d65
Compare
e460bde
to
48084cc
Compare
48084cc
to
0bcbcec
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think things work.
[Short description explaining the high-level reason for the pull request]
Changes
How I tested this
Notes
Checklist