Fix indexing with datetime64[ns] with pandas=1.1 #4292

shoyer · 2020-07-31T00:48:50Z

The underlying issue is that calling .item() on a NumPy array with
dtype=datetime64[ns] returns an integer, rather than an np.datetime64
scalar. This is somewhat baffling but works this way because .item()
returns native Python types, but datetime.datetime doesn't support
nanosecond precision.

pandas.Index.get_loc used to support these integers, but now is more strict.
Hence we get errors.

We can fix this by using array[()] to convert 0d arrays into NumPy scalars
instead of calling array.item().

I've added a crude regression test. There may well be a better way to test this
but I haven't figured it out yet.

Tests added
Passes isort . && black . && mypy . && flake8

Fixes pydata#4283 The underlying issue is that calling `.item()` on a NumPy array with `dtype=datetime64[ns]` returns an _integer_, rather than an `np.datetime64 scalar. This is somewhat baffling but works this way because `.item()` returns native Python types, but `datetime.datetime` doesn't support nanosecond precision. `pandas.Index.get_loc` used to support these integers, but now is more strict. Hence we get errors. We can fix this by using `array[()]` to convert 0d arrays into NumPy scalars instead of calling `array.item()`. I've added a crude regression test. There may well be a better way to test this but I haven't figured it out yet.

pep8speaks · 2020-07-31T00:49:06Z

Hello @shoyer! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2020-09-16 00:31:04 UTC

keewis · 2020-08-05T12:25:10Z

you should be able to isolate this to just indexing.convert_label_indexer, e.g. with this in test_indexing.py:

    def test_convert_label_indexer_datetime(self):
        index = pd.to_datetime(["2000-01-01", "2001-01-01", "2002-01-01"])
        actual = indexing.convert_label_indexer(index, "2001-01-01")
        expected = (1, None)
        assert actual == expected

        actual = indexing.convert_label_indexer(index, index.to_numpy()[1])
        assert actual == expected

The failing tests are due to label[()] returning a numpy.str_ instead of a plain python str. Maybe we can fix that by using item as long as the dtype is not "datetime64" or "timedelta64":

if label.dtype.kind in "mM":
    label_value = label[()]
else:
    label_value = label.item()

Edit: stable is affected by this, too

xarray/core/indexing.py

keewis · 2020-08-28T09:13:42Z

there are lots of people that stumble into this, so I think it might be good to get this to work as soon as possible and issue a bugfix release.

keewis · 2020-09-06T10:25:56Z

gentle ping, @shoyer. Any updates on this?

keewis · 2020-09-13T10:58:59Z

it seems we have to cast because label may also be something like a Variable object, which is not accepted by index.get_loc. I pushed my original fix (falling back to .item() for non-datetime / timedelta dtypes), I hope that's okay?

keewis · 2020-09-13T12:29:44Z

it seems pandas warns about our usage of pandas.Grouper:

/home/docs/checkouts/readthedocs.org/user_builds/xray/checkouts/4292/xarray/core/common.py:1134: FutureWarning: 'base' in .resample() and in Grouper() is deprecated.
The new arguments that you should use are 'offset' or 'origin'.

>>> df.resample(freq="3s", base=2)

becomes:

>>> df.resample(freq="3s", offset="2s")

  grouper = pd.Grouper(

this was introduced in 1.1.0. We're supporting pandas>=0.25 (maybe even >=0.24) so we can't switch yet. I added a warning filter and a todo comment, but we might also need a tracking issue.

xarray/core/indexing.py

dcherian · 2020-09-15T20:09:10Z

I am +1 on merging this quickly and issuing a bugfix release.

We can always make cleanups later...

Co-authored-by: keewis <keewis@users.noreply.github.com>

shoyer · 2020-09-16T01:34:00Z

OK, submitting this!

Thanks @keewis for making this fix actually work :)

seth-p · 2020-09-16T01:34:08Z

Does this fix #4363?

shoyer · 2020-09-16T01:45:35Z

Does this fix #4363?

No, that seems to be unrelated

max-sixty · 2020-09-16T03:11:48Z

I can do a patch release this weekend

lint fix

2fb13ee

This was referenced Aug 4, 2020

Fix map_blocks examples #4305

Merged

Indexing datetime broken with pandas 1.1.0 #4306

Closed

shoyer commented Aug 5, 2020

View reviewed changes

xarray/core/indexing.py Show resolved Hide resolved

This was referenced Aug 5, 2020

silence the known docs CI issues #4316

Merged

KeyError when faceting along time dimensions #4319

Closed

dcamron mentioned this pull request Aug 21, 2020

Getting python-training back to buiding Unidata/python-training#111

Merged

keewis mentioned this pull request Aug 23, 2020

Not able to slice dataset using its own coordinate value, after upgrade to pandas 1.1.0 #4370

Closed

philippjfr mentioned this pull request Aug 26, 2020

Time slider widget broken for pandas > 1.0.5 holoviz/hvplot#500

Closed

seth-p mentioned this pull request Aug 26, 2020

Indexing a datetime64[ns] coordinate with a scalar datetime.date produces a KeyError #4363

Open

add a test checking the datetime indexer

f367a6b

keewis added 2 commits September 13, 2020 12:46

use label.item() for non-datetime / timedelta labels

81a7250

Merge branch 'master' into datetime64-indexing-fix

ef98c32

keewis force-pushed the datetime64-indexing-fix branch from 65f6b9a to ef98c32 Compare September 13, 2020 10:46

unpin pandas in the docs

5941cc9

ignore the future warning about deprecated arguments to pandas.Grouper

a840e69

max-sixty reviewed Sep 15, 2020

View reviewed changes

xarray/core/indexing.py Show resolved Hide resolved

max-sixty and others added 2 commits September 15, 2020 17:22

Update xarray/core/indexing.py

61aefcc

Co-authored-by: keewis <keewis@users.noreply.github.com>

Add whatsnew note

0fb7005

shoyer merged commit 59f57f3 into pydata:master Sep 16, 2020

ChristopheLRTE mentioned this pull request Nov 28, 2021

Error about to_grib ecmwf/cfgrib#267

Closed

spencerkclark mentioned this pull request Nov 13, 2022

⚠️ Nightly upstream-dev CI failed ⚠️: pandas removed deprecated keyword arguments #7266

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix indexing with datetime64[ns] with pandas=1.1 #4292

Fix indexing with datetime64[ns] with pandas=1.1 #4292

shoyer commented Jul 31, 2020 •

edited by fujiisoup

Loading

pep8speaks commented Jul 31, 2020 •

edited

Loading

keewis commented Aug 5, 2020 •

edited

Loading

keewis commented Aug 28, 2020

keewis commented Sep 6, 2020

keewis commented Sep 13, 2020 •

edited

Loading

keewis commented Sep 13, 2020

dcherian commented Sep 15, 2020

shoyer commented Sep 16, 2020

seth-p commented Sep 16, 2020

shoyer commented Sep 16, 2020

max-sixty commented Sep 16, 2020

Fix indexing with datetime64[ns] with pandas=1.1 #4292

Fix indexing with datetime64[ns] with pandas=1.1 #4292

Conversation

shoyer commented Jul 31, 2020 • edited by fujiisoup Loading

pep8speaks commented Jul 31, 2020 • edited Loading

Comment last updated at 2020-09-16 00:31:04 UTC

keewis commented Aug 5, 2020 • edited Loading

keewis commented Aug 28, 2020

keewis commented Sep 6, 2020

keewis commented Sep 13, 2020 • edited Loading

keewis commented Sep 13, 2020

dcherian commented Sep 15, 2020

shoyer commented Sep 16, 2020

seth-p commented Sep 16, 2020

shoyer commented Sep 16, 2020

max-sixty commented Sep 16, 2020

shoyer commented Jul 31, 2020 •

edited by fujiisoup

Loading

pep8speaks commented Jul 31, 2020 •

edited

Loading

keewis commented Aug 5, 2020 •

edited

Loading

keewis commented Sep 13, 2020 •

edited

Loading