Make inspect.get_dataset_config_names always return a non-empty list of configs #3135

severo · 2021-10-22T08:02:50Z

Is your feature request related to a problem? Please describe.

Currently, some datasets have a configuration, while others don't. It would be simpler for the user to always have configuration names to refer to

Describe the solution you'd like

In that sense inspect.get_dataset_config_names should always return at least one configuration name, be it default or Check___region_1 (for community datasets like Check/region_1).

datasets/src/datasets/inspect.py

Line 161 in c5747a5

def get_dataset_config_names(

The text was updated successfully, but these errors were encountered:

albertvillanova · 2021-10-25T08:23:44Z

Hi @severo, I guess this issue requests not only to be able to access the configuration name (by using inspect.get_dataset_config_names), but the configuration itself as well (I mean you use the name to get the configuration afterwards, maybe using builder_cls.builder_configs), is this right?

severo · 2021-10-25T09:11:33Z

Yes, maybe the issue could be reformulated. As a user, I want to avoid having to manage special cases:

I want to be able to get the names of a dataset's configs, and use them in the rest of the API (get the data, get the split names, etc).
I don't want to have to manage datasets with named configs (glue) differently from datasets without named configs (acronym_identification, Check/region_1)

severo added the enhancement New feature or request label Oct 22, 2021

severo mentioned this issue Oct 22, 2021

Cannot get the config names for some datasets huggingface/dataset-viewer#78

Closed

severo added the dataset-viewer Related to the dataset viewer on huggingface.co label Oct 22, 2021

severo changed the title ~~Make inspect.get_dataset_config_names always return a nen-empty list of configs~~ Make inspect.get_dataset_config_names always return a non-empty list of configs Oct 22, 2021

albertvillanova self-assigned this Oct 25, 2021

albertvillanova mentioned this issue Oct 25, 2021

Make inspect.get_dataset_config_names always return a non-empty list #3159

Merged

albertvillanova closed this as completed in #3159 Oct 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make inspect.get_dataset_config_names always return a non-empty list of configs #3135

Make inspect.get_dataset_config_names always return a non-empty list of configs #3135

severo commented Oct 22, 2021

albertvillanova commented Oct 25, 2021 •

edited

Loading

severo commented Oct 25, 2021

Make inspect.get_dataset_config_names always return a non-empty list of configs #3135

Make inspect.get_dataset_config_names always return a non-empty list of configs #3135

Comments

severo commented Oct 22, 2021

albertvillanova commented Oct 25, 2021 • edited Loading

severo commented Oct 25, 2021

albertvillanova commented Oct 25, 2021 •

edited

Loading