You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It should produce a longer list of examples to exhaust all the datasets.
If you comment out the shuffle line, it will exhaust all the datasets properly.
Here is the output if you comment out shuffling:
Describe the bug
Shuffling interleaved
IterableDataset
with "all_exhausted" strategy yields non-exhaustive sampling.Steps to reproduce the bug
This code produces
Expected behavior
It should produce a longer list of examples to exhaust all the datasets.
If you comment out the shuffle line, it will exhaust all the datasets properly.
Here is the output if you comment out shuffling:
Environment info
datasets
version: 2.12.0This was run on Google Colab.
The text was updated successfully, but these errors were encountered: