Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[Data] [Docs] Improve docs around Parquet filter predicate / column s…
…election pushdown (ray-project#48095) ## Why are these changes needed? Improve docs around Parquet filter predicate / column selection pushdown, so that they are easier to access from multiple parts of the Ray Data docs, and improve the clarity of examples. Modified pages: - [Loading Data](https://anyscale-ray--48095.com.readthedocs.build/en/48095/data/loading-data.html) <img width="1298" alt="reading_files" src="https://github.com/user-attachments/assets/c760ecd4-4cfe-4547-8b88-3026fa12a13a"> - [Performance tips](https://anyscale-ray--48095.com.readthedocs.build/en/48095/data/performance-tips.html#parquet-column-pruning-projection-pushdown) <img width="1311" alt="performance_tips" src="https://github.com/user-attachments/assets/1fc894ae-dabf-4c33-bc27-ba0dcc9fddff"> - [`Dataset.select_columns`](https://anyscale-ray--48095.com.readthedocs.build/en/48095/data/api/doc/ray.data.Dataset.select_columns.html#ray.data.Dataset.select_columns) <img width="1338" alt="select_columns" src="https://github.com/user-attachments/assets/483a4bf2-acc7-4ca3-90e1-3f66563b3365"> - [`Dataset.filter`](https://anyscale-ray--48095.com.readthedocs.build/en/48095/data/api/doc/ray.data.Dataset.filter.html#ray.data.Dataset.filter) <img width="1319" alt="filter" src="https://github.com/user-attachments/assets/a9b1beb6-5b7c-415f-97b9-c2119513adde"> ## Related issue number ## Checks - [x] I've signed off every commit(by using the -s flag, i.e., `git commit -s`) in this PR. - [x] I've run `scripts/format.sh` to lint the changes in this PR. - [x] I've included any doc changes needed for https://docs.ray.io/en/master/. - [ ] I've added any new APIs to the API Reference. For example, if I added a method in Tune, I've added it in `doc/source/tune/api/` under the corresponding `.rst` file. - [x] I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/ - Testing Strategy - [ ] Unit tests - [ ] Release tests - [ ] This PR is not tested :( --------- Signed-off-by: Scott Lee <sjl@anyscale.com>
- Loading branch information