[Ray Data] Add filtering and column pruning when reading from BigQuery table #48821
Labels
data
Ray Data-related issues
enhancement
Request for new feature and/or capability
triage
Needs triage (eg: priority, bug/not-bug, and owning component)
Description
It would be great to have the ability to provide filters and columns to be read from the BQ table.
Use case
Existing implementation
As of now, I can run a code like this to get data from a table filtered and with only selected columns & filter conditions:
However, it will run this query and create temporary table introducing extra costs and delay before starting reading data.
Proposed option
On the other hand, BQ Read API supports providing filters and fields directly to the read request to the existing table, via TableReadOptions (parameters
selected_fields[]
androw_restriction
)So what I would like to have is to have an interface like this:
And these new fields to be propagated down to BQ Read API read request. In such case it will be streaming data directly from the existing table without extra costs and time spent on creating intermediate table.
The text was updated successfully, but these errors were encountered: