-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Creating an array from a pandas object column with mostly ints raises #3479
Comments
On the first question, that would be a question for the pandas project cc @jreback @TomAugspurger On the second question, what are you hoping to happen? We have discussed having an option to treat errors as nulls (https://issues.apache.org/jira/browse/ARROW-2098), so one possibility if you indicate that the column should be an integer, is that the lone string should be made null. You'd have to opt in to this behavior, though |
Thanks for the lightning-fast response @wesm
I was hoping that pyarrow could see that the column had an Is that reasonable? |
Pandas doesn't currently pass a That seems generally useful, but it wouldn't have helped with this exception right? Edit: Oh, reading #3479 (comment) I see that it may have helped if you opt into it. |
On the pandas / first question:
I think a user could now pass |
@max-sixty I thought that at first, but that |
Right, thanks @TomAugspurger |
I'm closing this for now. If you have a well-scoped feature request can you please open a JIRA issue? |
It looks like an object column with mostly ints is interpreted as a column with all ints:
Tracking down the call within the pyarrow library:
Thanks!
The text was updated successfully, but these errors were encountered: