-
-
Notifications
You must be signed in to change notification settings - Fork 18.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
API/DEPR: dtype=(str|bytes) interpret as pyarrow #52429
Comments
Another option is to interpret |
I'm also not clear (up-to-date) on what the thinking is here. (hence my comment in #52509 (comment)) |
A simple case I ran into today where the
|
@datapythonista both here and in #52711 a request has been made to explain how great pyarrow string dtypes are. Want to sing their praises? |
Looking at #35864, looks like "zfill" isnt implemented in arrow yet so is slightly slower, but other string methods mentioned later in the thread outperform quite a bit:
|
xref #49398 |
For the
With PDEP-14 accepted, the idea is that |
In 2.0 we made a lot of progress in ensuring passing dtype=foo or .astype(foo) actually returned the requested dtype rather than silently giving something else. bytes and str are the main remaining cases where we silently do something else (cast to object, but not as consistently as intended).
Instead, let's interpret
dtype=str
asstring[pyarrow]
anddtype=bytes
asbytes[pyarrow]
(with a deprecation cycle, and once we require pyarrow)The text was updated successfully, but these errors were encountered: