-
Notifications
You must be signed in to change notification settings - Fork 304
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add max_results option to QueryJob.to_dataframe and QueryJob.to_arrow methods #296
Comments
@tswast Isn't rows_iterator = query_job.result()
rows_iterator.to_dataframe(max_results=42) The Also, |
I don't see python-bigquery/google/cloud/bigquery/client.py Lines 3140 to 3150 in 790d11b
I'm pretty sure that's the current behavior.
Oh! No, I didn't mean here. I meant on the |
(Looking at this, probably the solution in pandas-gbq is to call |
Ah, sorry, I meant the row iterator, yes.
I thought there was a reason for that in Pandas GBQ and that you wanted to pass Looks like we are almost set then, we just need to make sure that |
Currently, pandas-gbq calls
QueryJob.result()
andClient.list_rows()
directlyhttps://github.com/pydata/pandas-gbq/blob/46c579ac21879b431c8568b49e68624f4a5ea25e/pandas_gbq/gbq.py#L561-L564
This is because the
max_results
parameter is needed, but not available into_dataframe
.Currently, this is not a problem except for some duplicate code, but it may keep pandas-gbq from benefiting from the "fast query path" changes currently being designed.
The text was updated successfully, but these errors were encountered: