Skip to content

Releases: tensorflow/datasets

v4.9.7

30 Oct 12:58
Compare
Choose a tag to compare

Added

  • New datasets.

Changed

  • CroissantBuilder's API to generate TFDS datasets from Croissant files.

Deprecated

Removed

Fixed

  • Versions for existing datasets.

Security

v4.9.6

05 Jun 08:15
Compare
Choose a tag to compare

Added

  • Full support for Python 3.12.

v4.9.5

30 May 08:37
Compare
Choose a tag to compare

Added

  • Support to download and prepare datasets using the
    Parquet data format.

    builder = tfds.builder('fashion_mnist', file_format='parquet')
    builder.download_and_prepare()
    ds = builder.as_dataset(split='train')
    print(next(iter(ds)))
  • tfds.data_source
    is pickable, thus working smoothly with
    PyGrain. Learn more by following the
    tutorial.

  • TFDS plays nicely with
    Croissant. Learn more by
    following the
    recipe.

Changed

Deprecated

Removed

Fixed

Security

v4.9.4

18 Dec 13:28
Compare
Choose a tag to compare

Added

  • A new CroissantBuilder
    which initializes a DatasetBuilder based on a Croissant
    metadata file.
  • New conversion options between different bounding boxes formats.
  • Better support for HuggingfaceDatasetBuilder.
  • A script
    to convert a dataset from one format to another.

Changed

Deprecated

  • Python 3.9 support. TFDS now uses Python 3.10

Removed

Fixed

Security

v4.9.3

08 Sep 09:07
Compare
Choose a tag to compare

Added

Changed

  • Hugging Face datasets accept None values for any features. TFDS has no
    tfds.features.Optional, so None values are converted to default values.
    Those default values used to be 0 and 0.0 for int and float. Now, it's
    -inf as defined by NumPy (e.g., np.iinfo(np.int32).min or
    np.finfo(np.float32).min). This avoids ambiguous values when 0 and 0.0
    exist in the values of the dataset. The roadmap is to implement
    tfds.features.Optional.

Deprecated

  • Python 3.8 support. As per
    NEP 29, TFDS now
    uses Python>=3.9.

Removed

Fixed

Security

v4.9.2

13 Apr 11:21
Compare
Choose a tag to compare

Added

  • [Experimental] A list of freeform text tags can now be attached to a
    BuilderConfig. For example:
    BUILDER_CONFIGS = [
        tfds.core.BuilderConfig(name="foo", tags=["foo", "live"]),
        tfds.core.BuilderConfig(name="bar", tags=["bar", "old"]),
    ]
    The tags are recorded with the dataset metadata and can later be retrieved
    using the info object:
    builder.info.config_tags  # ["foo", "live"]
    This feature is experimental and there are no guidelines on tags format.

Changed

Deprecated

Removed

Fixed

  • Fixed generated proto files (see issue 4858).

Security

v4.9.1

11 Apr 13:16
Compare
Choose a tag to compare

Added

Changed

Deprecated

Removed

Fixed

  • The installation on macOS now works (see issues
    4805 and
    4852). The ArrayRecord
    dependency is lazily loaded, so the
    TensorFlow-less path is
    not possible at the moment on macOS. A fix for this will follow soon.

Security

v4.9.0

05 Apr 07:30
Compare
Choose a tag to compare

Added

Changed

  • Support for tensorflow=2.12.

Deprecated

Removed

Fixed

Security

v4.8.3

27 Feb 11:46
Compare
Choose a tag to compare

Added

Changed

Deprecated

  • Python 3.7 support: this version and future version use Python 3.8.

Removed

Fixed

  • Flag ignore_verifications from Hugging Face's datasets.load_dataset is
    deprecated, and used to cause errors in tfds.load(huggingface:foo).

Security

v4.8.2

17 Jan 20:41
Compare
Choose a tag to compare

Deprecated

  • Python 3.7 support: this is the last version of TFDS supporting Python 3.7.
    Future versions will use Python 3.8.

Fixed

  • tfds new and tfds build better support the new recommended datasets
    organization, where individual datasets have their own package under
    datasets/, builder class is called Builder and is defined within module
    ${dsname}_dataset_builder.py.

Security