Decoding times in num2date exactly with timedelta arithmetic #171

spencerkclark · 2020-05-10T15:55:23Z

Timedelta arithmetic in cftime uses integer arithmetic, which is precise down to the microsecond. If passed an array of integers, or an array of floats that can safely be cast to integers, would it be possible for num2date to default to decoding times using that instead of the current imprecise floating point method? This would be a helpful improvement to the user experience in downstream libraries like xarray, where microsecond noise introduced in the times by floating point arithmetic can interfere with automatic alignment. See also other issues in this repo related to decoding precision, e.g. #78, #134.

This gist is a simplified sketch of how this code path could be implemented (I'm happy to develop it further into a PR if it seems promising). The idea would be to try to use this method first, and fall back to using the existing floating point method if it wasn't possible. The only reservation I would have about this is that it is about 9x slower than decoding times using floats. In my use-cases, however, this slowdown would gladly be accepted if it meant times would be decoded exactly, but perhaps others decoding more than 10000 times at a time would find this frustrating.

Are there any other reasons I'm not thinking of for why this might not be a good idea?

The text was updated successfully, but these errors were encountered:

jswhit · 2020-05-11T15:05:51Z

@spencerkclark - a reworking of the date calculations is long overdue, so a pull request would be welcome. Probably best if it's implemented as an non-default option at first (especially if it's 10x slower).

spencerkclark · 2020-05-11T15:22:24Z

Sounds good -- when I get a chance I'll try and put something more polished together.

jswhit · 2020-05-29T15:08:37Z

Is it possible to have an exact inverse of this (date2num_exact) so the round-trip works without loss of precision?

jswhit · 2020-05-29T21:32:41Z

Something like this should work as an inverse:

def date2num_exact(dates,units,calendar='standard'):
    calendar = calendar.lower()
    basedate = to_calendar_specific_datetime(_dateparse(units), calendar)
    unit, ignore = _datesplit(units)
    if unit not in UNIT_CONVERSION_FACTORS:
        raise ValueError("Unsupported time units provided, {!r}.".format(unit))
    if unit in ["months", "month"] and calendar != "360_day":
        raise ValueError("Units of months only valid for 360_day calendar.")
    factor = UNIT_CONVERSION_FACTORS[unit]
    deltas = dates-basedate
    times = (deltas/timedelta(microseconds=1)) / factor
    return times

with some logic to handle masked arrays?

spencerkclark · 2020-05-29T22:59:18Z

Is it possible to have an exact inverse of this (date2num_exact) so the round-trip works without loss of precision?

That would be great too. The tricky part there is that currently subtracting a date from another to produce a timedelta is not microsecond-exact:

In [1]: import cftime

In [2]: cftime.DatetimeNoLeap(2000, 1, 2, 0, 0, 0, 5) - cftime.DatetimeNoLeap(2000, 1, 2)
Out[2]: datetime.timedelta(microseconds=8)

This is another issue I'd be keen to address. I haven't thought about the best way to do this. We currently use a bit of a hack in xarray to work around this -- recall the function in this issue -- but maybe there's a more elegant / direct way.

spencerkclark mentioned this issue May 25, 2020

Update num2date function to decode times exactly using timedelta addition #176

Merged

spencerkclark mentioned this issue Jun 21, 2020

Make cftime.datetime - cftime.datetime -> timedelta microsecond-exact #178

Merged

jswhit closed this as completed in #176 Jun 22, 2020

rcomer mentioned this issue Jun 30, 2021

Adopt microsecond precision in num2date and date2num SciTools/cf-units#184

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decoding times in num2date exactly with timedelta arithmetic #171

Decoding times in num2date exactly with timedelta arithmetic #171

spencerkclark commented May 10, 2020

jswhit commented May 11, 2020

spencerkclark commented May 11, 2020

jswhit commented May 29, 2020

jswhit commented May 29, 2020 •

edited

Loading

spencerkclark commented May 29, 2020

Decoding times in num2date exactly with timedelta arithmetic #171

Decoding times in num2date exactly with timedelta arithmetic #171

Comments

spencerkclark commented May 10, 2020

jswhit commented May 11, 2020

spencerkclark commented May 11, 2020

jswhit commented May 29, 2020

jswhit commented May 29, 2020 • edited Loading

spencerkclark commented May 29, 2020

jswhit commented May 29, 2020 •

edited

Loading