-
Notifications
You must be signed in to change notification settings - Fork 2.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix cache path to snakecase for CachedDatasetModuleFactory
and Cache
#6754
Conversation
@lhoestq hi 😃, is there something else I need to do to check this change? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi ! Thanks for the fix :) I added a comment
I also created https://huggingface.co/datasets/hf-internal-testing/DatasetWithCapitalLetters if you want to use it to add a test in test_cache.py
(similar to test_cache_single_config
for example)
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>
I added two tests and passed them on my server.
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
Hi @izhx! I have also faced this issue, happy to see it already addressed, looking forward for PR merge :) |
@lhoestq What do you think of these tests? 😀 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM ! thanks :)
Show benchmarksPyArrow==8.0.0 Show updated benchmarks!Benchmark: benchmark_array_xd.json
Benchmark: benchmark_getitem_100B.json
Benchmark: benchmark_indices_mapping.json
Benchmark: benchmark_iterating.json
Benchmark: benchmark_map_filter.json
Show updated benchmarks!Benchmark: benchmark_array_xd.json
Benchmark: benchmark_getitem_100B.json
Benchmark: benchmark_indices_mapping.json
Benchmark: benchmark_iterating.json
Benchmark: benchmark_map_filter.json
|
Fix #6750 (comment)
I didn't find a guideline on how to run the tests, so i just run the following steps to make sure that this bug is fixed.
python test.py
,HF_DATASETS_OFFLINE=1 python test.py
The
test.py
is