Progress bar missing with litdata.StreamingDataset
and wrong number of steps in an epoch
#112
Labels
litdata.StreamingDataset
and wrong number of steps in an epoch
#112
🐛 Bug
There are two separate issues here.
When training a model using
litdata.StreamingDataset
, thetqdm
progress bar shows{steps}/?
and the estimated time is missing.Moreover, the total number of steps in an epoch seems to be independent of the number of GPUs. Instead of having
total_steps = num_samples / (num_gpus * batch_size)
, the log returnstotal_steps = num_samples / batch_size
Expected behavior
The progress bar should show the estimated time and the fraction of steps that have been completed.
total_steps = num_samples / (num_gpus * batch_size)
cc @tchaton
The text was updated successfully, but these errors were encountered: