0.3.0
What's Changed
- [DataComp] Add cluster component by @NielsRogge in #239
- Enable building specified components by @PhilippeMoussalli in #265
- Order output columns in PandasTransformComponent by @RobbeSneyders in #276
- Always pull images in local runner by @RobbeSneyders in #279
- Fix test warnings by @RobbeSneyders in #280
- Large scale controlnet by @PhilippeMoussalli in #260
- Make components cloud agnostic by @PhilippeMoussalli in #281
- Bump jsonschema version to 4.18.0 by @RobbeSneyders in #284
- Run tests against fondant package with tox by @RobbeSneyders in #283
- [LLM pipeline] Add filter out short texts component by @mrchtr in #247
- Fix running tox on the inferior OS by @GeorgesLorre in #287
- Update getting_started.md by @janvanlooy in #286
- Add defaults to components by @PhilippeMoussalli in #289
- Remove obsolete packages by @PhilippeMoussalli in #293
- Update pre-commit config with new folder structure by @GeorgesLorre in #294
- Add fsspec as explicit dependency by @RobbeSneyders in #299
- Revert src/fondant/components after testing with tox by @RobbeSneyders in #298
- Don't use from_registry for generic components by @RobbeSneyders in #285
- [LLM pipeline] MinHash generation for deduplication by @mrchtr in #295
- Split component implementation and execution by @RobbeSneyders in #302
- Bugfix default 0 values by @PhilippeMoussalli in #304
- Update script to work with macos by @GeorgesLorre in #308
- Bugfix: Data explorer local runner usage by @mrchtr in #307
- Add --build-arg argument to compile and run commands by @RobbeSneyders in #306
- Bugfix: data explorer artifact mounting by @mrchtr in #310
- [Commoncrawl pipeline] Add component extract free-to-use images by @shayorshay in #282
- Introduce repartitioning by @PhilippeMoussalli in #309
- Bugfix/partitioning by @PhilippeMoussalli in #312
- Add code for reusable load from files component #290 by @satishjasthi in #296
- Unify manifest save path by @PhilippeMoussalli in #322
- Bugfix basepath by @PhilippeMoussalli in #324
- Add test cases for caption_images component and fixed bug in this com… by @satishjasthi in #311
- Remove local images in build script to conserve space by @GeorgesLorre in #326
- Change base image to smaller version by @GeorgesLorre in #330
- [Scripts] Fix build_components by @NielsRogge in #332
- Change subset merging method by @PhilippeMoussalli in #334
- Add node pool label by @shayorshay in #327
- Update docs link to stable version by @RobbeSneyders in #336
- Add int64 dtype by @NielsRogge in #338
- [load_from_hf_hub] Add dataset_length, set_index by @NielsRogge in #339
New Contributors
- @janvanlooy made their first contribution in #286
- @satishjasthi made their first contribution in #296
Full Changelog: 0.2.1...0.3.0