Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat(datasets): Add NetCDFDataSet class (kedro-org#360)
* initialize template and early additions Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * add placeholder for remote file system load Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * switch to versioned dataset Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * add initial remote -> local get for S3 Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * further generalize remote retrieval Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * add in credentials Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * make temppath optional for remote datasets Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * add initial idea for multifile glob Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * style: Introduce `ruff` for linting in all plugins. (kedro-org#354) Signed-off-by: Merel Theisen <merel.theisen@quantumblack.com> Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * add suggested style changes Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * add temppath to attributes Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * more temppath fixes Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * more temppath updates Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * add better tempfile deletion and work on saving files Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * make __del__ flexible Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * formatting Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * feat(datasets): create custom `DeprecationWarning` (kedro-org#356) * feat(datasets): create custom `DeprecationWarning` Signed-off-by: Deepyaman Datta <deepyaman.datta@utexas.edu> * feat(datasets): use the custom deprecation warning Signed-off-by: Deepyaman Datta <deepyaman.datta@utexas.edu> * chore(datasets): show Kedro's deprecation warnings Signed-off-by: Deepyaman Datta <deepyaman.datta@utexas.edu> * fix(datasets): remove unused imports in test files Signed-off-by: Deepyaman Datta <deepyaman.datta@utexas.edu> --------- Signed-off-by: Deepyaman Datta <deepyaman.datta@utexas.edu> Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * docs(datasets): add note about DataSet deprecation (kedro-org#357) Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * test(datasets): skip `tensorflow` tests on Windows (kedro-org#363) Signed-off-by: Deepyaman Datta <deepyaman.datta@utexas.edu> Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * ci: Pin `tables` version (kedro-org#370) * Pin tables version Signed-off-by: Ankita Katiyar <ankitakatiyar2401@gmail.com> * Also fix kedro-airflow Signed-off-by: Ankita Katiyar <ankitakatiyar2401@gmail.com> * Revert trying to fix airflow Signed-off-by: Ankita Katiyar <ankitakatiyar2401@gmail.com> --------- Signed-off-by: Ankita Katiyar <ankitakatiyar2401@gmail.com> Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * build(datasets): Release `1.7.1` (kedro-org#378) Signed-off-by: Merel Theisen <merel.theisen@quantumblack.com> Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * docs: Update CONTRIBUTING.md and add one for `kedro-datasets` (kedro-org#379) Update CONTRIBUTING.md + add one for kedro-datasets Signed-off-by: Ankita Katiyar <ankitakatiyar2401@gmail.com> Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * ci(datasets): Run tensorflow tests separately from other dataset tests (kedro-org#377) Signed-off-by: Merel Theisen <merel.theisen@quantumblack.com> Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * feat: Kedro-Airflow convert all pipelines option (kedro-org#335) * feat: kedro airflow convert --all option Signed-off-by: Simon Brugman <sfbbrugman@gmail.com> * docs: release docs Signed-off-by: Simon Brugman <sfbbrugman@gmail.com> --------- Signed-off-by: Simon Brugman <sfbbrugman@gmail.com> Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * docs(datasets): blacken code in rst literal blocks (kedro-org#362) Signed-off-by: Deepyaman Datta <deepyaman.datta@utexas.edu> Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * docs: cloudpickle is an interesting extension of the pickle functionality (kedro-org#361) Signed-off-by: H. Felix Wittmann <hfwittmann@gmail.com> Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * fix(datasets): Fix secret scan entropy error (kedro-org#383) Fix secret scan entropy error Signed-off-by: Merel Theisen <merel.theisen@quantumblack.com> Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * style: Rename mentions of `DataSet` to `Dataset` in `kedro-airflow` and `kedro-telemetry` (kedro-org#384) Signed-off-by: Merel Theisen <merel.theisen@quantumblack.com> Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * feat(datasets): Migrated `PartitionedDataSet` and `IncrementalDataSet` from main repository to kedro-datasets (kedro-org#253) Signed-off-by: Peter Bludau <ptrbld.dev@gmail.com> Co-authored-by: Merel Theisen <merel.theisen@quantumblack.com> Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * fix: backwards compatibility for `kedro-airflow` (kedro-org#381) Signed-off-by: Simon Brugman <sfbbrugman@gmail.com> Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * fix(datasets): Don't warn for SparkDataset on Databricks when using s3 (kedro-org#341) Signed-off-by: Alistair McKelvie <alistair.mckelvie@gmail.com> Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * update docs API and release notes Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * add netcdf requirements to setup Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * lint Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * add initial tests Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * update dataset exists for multifile Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * Add full test suite for NetCDFDataSet Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * Add docstring examples Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * change xarray version req Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * update dask req Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * rename DataSet -> Dataset Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * Update xarray reqs for earlier python versions Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * fix setup Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * update test coverage Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * exclude init from test coverage Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * Sub in pathlib for os.remove Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * add metadata to dataset Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * add doctest for the new datasets Signed-off-by: Nok <nok.lam.chan@quantumblack.com> * add patch for supporting http/https Signed-off-by: Riley Brady <riley_brady@mckinsey.com> * Small fixes post-merge Signed-off-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com> * Lint Signed-off-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com> * Fix import Signed-off-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com> * Un-ignore NetCDF doctest Signed-off-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com> * Add fixture Signed-off-by: Ankita Katiyar <ankitakatiyar2401@gmail.com> * Mark problematic test as xfail Signed-off-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com> * Skip problematic test instead of making it fail Signed-off-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com> * Skip problematic tests and fix failing tests Signed-off-by: Ankita Katiyar <ankitakatiyar2401@gmail.com> * Remove comment Signed-off-by: Ankita Katiyar <ankitakatiyar2401@gmail.com> --------- Signed-off-by: Riley Brady <riley_brady@mckinsey.com> Signed-off-by: Merel Theisen <merel.theisen@quantumblack.com> Signed-off-by: Deepyaman Datta <deepyaman.datta@utexas.edu> Signed-off-by: Ankita Katiyar <ankitakatiyar2401@gmail.com> Signed-off-by: Simon Brugman <sfbbrugman@gmail.com> Signed-off-by: H. Felix Wittmann <hfwittmann@gmail.com> Signed-off-by: Peter Bludau <ptrbld.dev@gmail.com> Signed-off-by: Alistair McKelvie <alistair.mckelvie@gmail.com> Signed-off-by: Merel Theisen <49397448+merelcht@users.noreply.github.com> Signed-off-by: Nok Lam Chan <nok.lam.chan@quantumblack.com> Signed-off-by: Nok <nok.lam.chan@quantumblack.com> Signed-off-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com> Signed-off-by: Ankita Katiyar <110245118+ankatiyar@users.noreply.github.com> Co-authored-by: Merel Theisen <49397448+merelcht@users.noreply.github.com> Co-authored-by: Deepyaman Datta <deepyaman.datta@utexas.edu> Co-authored-by: Ankita Katiyar <110245118+ankatiyar@users.noreply.github.com> Co-authored-by: Simon Brugman <sbrugman@users.noreply.github.com> Co-authored-by: Felix Wittmann <hfwittmann@users.noreply.github.com> Co-authored-by: PtrBld <7523956+PtrBld@users.noreply.github.com> Co-authored-by: Merel Theisen <merel.theisen@quantumblack.com> Co-authored-by: Alistair McKelvie <alistair.mckelvie@gmail.com> Co-authored-by: Nok Lam Chan <nok.lam.chan@quantumblack.com> Co-authored-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com> Co-authored-by: Ankita Katiyar <ankitakatiyar2401@gmail.com> Signed-off-by: tgoelles <thomas.goelles@gmail.com>
- Loading branch information