DEPR: GH10623 remove items from msgpack.encode for blocks #12129

kawochen · 2016-01-25T06:02:48Z

kawochen · 2016-01-25T09:05:16Z

hmm legacy file for 0.17.1 not passing. I think it's tz-related. Will check later.

kawochen · 2016-01-27T13:20:52Z

passing now. had to make changes for DatetimeTZBlock (it wasn't tested?) and skip files packed in P2 with 0.17 :( due to #12142

jreback · 2016-01-27T13:27:48Z

pandas/io/packers.py

-                                'locs': b.mgr_locs.as_array,
-                                'values': convert(b.values),
+                    'blocks': [{'locs': b.mgr_locs.as_array,
+                                'values': convert_if_not_dt_index(b.values),


this is way hacky. you shouldn't need to specifically do this (test for DatetimeIndex. It should just be converted normally. The entire difference here is the dispatch on the dtype, which in this case would be a string.

if it gets converted to a numpy array now, wouldn't the whole encode/decode for DatetimeIndex be skipped? Or is there a way to construct DatetimeTZBlock from numpy arrays?

actually I think you could trivially change core/internals.py/make_block to take a DatetimeTZDtype and correctly create it. Then you would have to make to serialize that properly (as a string is prob ok) I think (you would need to fix the reverse lookup as it only deals with numpy atm).

I think the same problem exists with categorical, prob not testing it.

kawochen · 2016-02-10T12:23:23Z

updated

jreback · 2016-02-10T15:49:35Z

pandas/core/internals.py

+            dtype = kwargs.pop('dtype')
+            if isinstance(dtype, compat.string_types):
+                dtype = DatetimeTZDtype.construct_from_string(dtype)
+            values = values.tz_localize('UTC').tz_convert(dtype.tz)

        if not isinstance(values, self._holder):


you should do this first, then you can coerce if you have a dtype

jreback · 2016-02-10T18:42:52Z

fyi some of these fixes will prob address: #11455, maybe #11755

kawochen · 2016-02-11T14:03:41Z

updated... but I forgot to squash :( will do that later

jreback · 2016-02-11T14:17:30Z

pandas/io/packers.py

@@ -245,6 +246,10 @@ def unconvert(values, dtype, compress=None):
    if dtype == np.object_:
        return np.array(values, dtype=object)

+    if isinstance(dtype, string_types):


I get that you have to do this, we prob need something like (in core/common.py)

def pandas_dtype(dtype): if isinstance(dtype, compat.string_types): try: return DatetimeTZDtype.construct_from_string(dtype) except TypeError: pass try: return CategoricalDtype.construct_from_string(dtype) except TypeError: pass dtype = np.dtype(dtype) return dtype

jreback · 2016-02-11T14:20:08Z

pls add a whatsnew as well (put in API changes).
make a note in io.rst that the writing format is changing in 0.18.0 so we won't have forward compat (IOW older version of pandas cannot read newer version)

jreback · 2016-02-12T17:56:12Z

this would close #12142 as well? (should add a warning in the docs about this) (e.g. show your matrix)

kawochen · 2016-02-12T18:15:05Z

I can change all the keys to unicode, and then it will. will update tonight

kawochen · 2016-02-13T10:08:34Z

Ah half way through but have to go. I will get it done tonight but is that too late? :(

jreback · 2016-02-13T14:58:14Z

sure no prob

kawochen · 2016-02-14T16:34:33Z

updated. manually tested with python 2.7 and 3.5 on linux (zlib and no compress).

jreback · 2016-02-14T16:51:26Z

pandas/io/packers.py

@@ -265,28 +269,31 @@ def unconvert(values, dtype, compress=None):
    return np.fromstring(values, dtype=dtype)


+def _u(x):


just use u() from compat/__init__.py (and amend with this).

kawochen · 2016-02-16T06:05:53Z

updated. So this is how I tested P2 & P3 compatibility: use the script to generate legacy data for 0.18.0 in P2 and P3, put them in 0.18.0 folder, and let the tests pick them up.

jreback · 2016-02-16T14:52:21Z

thanks!

I added some more test msgpacks for 0.17.1 to cover the bases

jreback · 2016-02-16T17:10:30Z

https://travis-ci.org/pydata/pandas/jobs/109620001

I think we need to skip if blosc is not installed on these tests.

can you do a PR for this?

kawochen force-pushed the DEPR-10623 branch from 985e366 to 11e4c19 Compare January 25, 2016 09:02

kawochen force-pushed the DEPR-10623 branch from 11e4c19 to 95a5886 Compare January 27, 2016 01:58

jreback reviewed Jan 27, 2016
View reviewed changes

chris-b1 mentioned this pull request Feb 1, 2016

WIP/ENH: allow categoricals in msgpack #12191

Closed

jreback added Msgpack Deprecate Functionality to remove in pandas labels Feb 1, 2016

kawochen force-pushed the DEPR-10623 branch 2 times, most recently from d1f9ed9 to 14c34c1 Compare February 10, 2016 10:48

jreback reviewed Feb 10, 2016
View reviewed changes

jreback reviewed Feb 11, 2016
View reviewed changes

jreback added this to the 0.18.0 milestone Feb 11, 2016

jsexauer mentioned this pull request Feb 13, 2016

DEPR: Clean up list of deprecations from prior versions #6581

Closed

1 task

kawochen force-pushed the DEPR-10623 branch from e7632a4 to e243e67 Compare February 14, 2016 12:43

jreback reviewed Feb 14, 2016
View reviewed changes

kawochen force-pushed the DEPR-10623 branch from e243e67 to 4d5c7c1 Compare February 16, 2016 00:10

DEPR: GH10623 remove items from msgpack.encode for blocks

407be1c

kawochen force-pushed the DEPR-10623 branch from 4d5c7c1 to 407be1c Compare February 16, 2016 05:37

jreback closed this in 3358afc Feb 16, 2016

jreback mentioned this pull request Feb 17, 2016

BUG: Unpacking PY2 msgpack in PY3 #12142

Closed

jreback mentioned this pull request Jul 24, 2016

DEPR: deprecations log for removed issues #13777

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DEPR: GH10623 remove items from msgpack.encode for blocks #12129

DEPR: GH10623 remove items from msgpack.encode for blocks #12129

kawochen commented Jan 25, 2016

kawochen commented Jan 25, 2016

kawochen commented Jan 27, 2016

jreback Jan 27, 2016

kawochen Jan 27, 2016

jreback Jan 27, 2016

kawochen commented Feb 10, 2016

jreback Feb 10, 2016

jreback commented Feb 10, 2016

kawochen commented Feb 11, 2016

jreback Feb 11, 2016

jreback commented Feb 11, 2016

jreback commented Feb 12, 2016

kawochen commented Feb 12, 2016

kawochen commented Feb 13, 2016

jreback commented Feb 13, 2016

kawochen commented Feb 14, 2016

jreback Feb 14, 2016

kawochen commented Feb 16, 2016

jreback commented Feb 16, 2016

jreback commented Feb 16, 2016

		@@ -265,28 +269,31 @@ def unconvert(values, dtype, compress=None):
		return np.fromstring(values, dtype=dtype)


		def _u(x):

DEPR: GH10623 remove items from msgpack.encode for blocks #12129

DEPR: GH10623 remove items from msgpack.encode for blocks #12129

Conversation

kawochen commented Jan 25, 2016

kawochen commented Jan 25, 2016

kawochen commented Jan 27, 2016

jreback Jan 27, 2016

Choose a reason for hiding this comment

kawochen Jan 27, 2016

Choose a reason for hiding this comment

jreback Jan 27, 2016

Choose a reason for hiding this comment

kawochen commented Feb 10, 2016

jreback Feb 10, 2016

Choose a reason for hiding this comment

jreback commented Feb 10, 2016

kawochen commented Feb 11, 2016

jreback Feb 11, 2016

Choose a reason for hiding this comment

jreback commented Feb 11, 2016

jreback commented Feb 12, 2016

kawochen commented Feb 12, 2016

kawochen commented Feb 13, 2016

jreback commented Feb 13, 2016

kawochen commented Feb 14, 2016

jreback Feb 14, 2016

Choose a reason for hiding this comment

kawochen commented Feb 16, 2016

jreback commented Feb 16, 2016

jreback commented Feb 16, 2016