-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow users compute statistics over retrieved batch datasets #799
Conversation
/test test-end-to-end-batch |
.../src/main/java/feast/storage/connectors/bigquery/statistics/BigQueryStatisticsRetriever.java
Outdated
Show resolved
Hide resolved
.../connectors/bigquery/src/main/java/feast/storage/connectors/bigquery/statistics/Dataset.java
Outdated
Show resolved
Hide resolved
/test test-end-to-end-batch |
1 similar comment
/test test-end-to-end-batch |
1d3a141
to
a120401
Compare
… retrieved batch datasets
f404741
to
e8ae893
Compare
/test test-end-to-end-batch |
7cbbc7b
to
d2c2b97
Compare
8779770
to
6ca725c
Compare
0155066
to
7c93e2f
Compare
/test test-end-to-end-redis-cluster |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: woop, zhilingc The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/kind feature |
What this PR does / why we need it:
This PR adds the functionality to allow users to optionally compute tfdv statistics over retrieved batch datasets:
Like with the feature set statistics, these statstics can then be fed into tfdv for validation, or facets to explore the dataset.
Which issue(s) this PR fixes:
Fixes #
Does this PR introduce a user-facing change?: