Fixed latex output for multi-indexed dataframes - GH9778 #9908

yred · 2015-04-15T18:44:18Z

Proposed fix for #9778

The formatting issue was caused by an incorrect number of elements in the (first) index columns of strcols. The length of reinserted columns was based on the number of elements per index level, but should have relied on the number of rows/occurrences of such elements.

yred · 2015-04-15T19:47:42Z

pandas/core/format.py

+                lev3 = [blank] * clevels
+                for level_idx, group in itertools.groupby(
+                        self.frame.index.labels[i]):
+                    count = sum(1 for _ in group)


I've originally had this as:

count = len(list(group))

but was wondering if using sum() makes a bit more sense from a performance perspective.

On the other hand, there's probably a more elegant way to fix the count mismatch without using itertools.groupby().

Performance shouldn't really be a concern here... nobody is going to output latex for tables much larger than can fit on a single page.

Would it then make more sense to switch it back to how it was? (I guess the first form is a bit more readable/intuitive).

doesn't really matter to me either way :)

Changed it back to the original form (which seems unexpectedly faster from a simple test).

In general, you shouldn't trust benchmarks that show list expressions are faster than generators... allocating memory in repeatedly in a benchmark is faster than it is in real use. But again, this is not performance limited code.

shoyer · 2015-04-15T22:01:16Z

This looks great to me. Can you please:

add a note to what's new
squash your changes into one commit (we prefer not to have commits for which Travis tests fail merged into master, because it can complicate things like git bisect)

yred · 2015-04-15T22:39:04Z

@shoyer: Thanks a lot for your feedback.

I've just added a note to the next release, and squashed all the commits. Do let me know if there's anything else that should be added/updated.

shoyer · 2015-04-15T23:06:57Z

This looks to good to go, once the tests on Travis pass. I'll try to check later, but please feel free to ping me if you notice that's happened.

yred · 2015-04-16T00:32:05Z

@shoyer: I'm not sure how Travis is set up on this repo, but tests are passing on my fork.

hayd · 2015-04-16T00:56:52Z

Do we handle if there are df.index.names ?

Thanks for fixing this!

yred · 2015-04-16T01:11:45Z

@hayd: That's a good point.

It currently doesn't print out index names. Would that be expected with this fix or should I add it in a separate PR?

shoyer · 2015-04-16T01:16:41Z

At this point probably better to make a separate PR. But I do recall being able to export index names....

On Wed, Apr 15, 2015 at 6:11 PM, Yasin A. notifications@github.com
wrote:

@hayd: That's a good point.

It currently doesn't print out index names. Would that be expected with this fix or should I add it in a separate PR?

Reply to this email directly or view it on GitHub:
#9908 (comment)

yred · 2015-04-16T01:21:55Z

@shoyer: it seems that regression happened a little earlier as well.

I'll be updating the code for that (separately), and potentially cleaning up to_latex() so that things are a bit clearer/cleaner.

hayd · 2015-04-16T01:49:19Z

@yred Fantastic, thanks!

Fixed latex output for multi-indexed dataframes - GH9778

shoyer · 2015-04-16T06:14:44Z

thanks @yred !

yred reviewed Apr 15, 2015
View reviewed changes

yred force-pushed the fix-mi-to-latex branch from d0fdaba to b2a1211 Compare April 15, 2015 22:07

BUG: Fixed latex output for multi-indexed dataframes - GH9778

4d1268e

yred force-pushed the fix-mi-to-latex branch from b2a1211 to 4d1268e Compare April 15, 2015 22:36

shoyer added a commit that referenced this pull request Apr 16, 2015

Merge pull request #9908 from yred/fix-mi-to-latex

161f38d

Fixed latex output for multi-indexed dataframes - GH9778

shoyer merged commit 161f38d into pandas-dev:master Apr 16, 2015

shoyer mentioned this pull request Apr 16, 2015

Wrong number of row in to_latex output for MultiIndex Dataframe #9778

Closed

jorisvandenbossche mentioned this pull request Jul 23, 2015

BUG: to_latex() output broken when the index has a name #10660

Closed

jreback added this to the 0.16.1 milestone Jul 23, 2015

jreback added Output-Formatting __repr__ of pandas objects, to_string MultiIndex labels Jul 23, 2015

jorisvandenbossche mentioned this pull request Apr 16, 2017

to_latex with MI column and index names #8336

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed latex output for multi-indexed dataframes - GH9778 #9908

Fixed latex output for multi-indexed dataframes - GH9778 #9908

yred commented Apr 15, 2015

yred Apr 15, 2015

shoyer Apr 15, 2015

yred Apr 15, 2015

shoyer Apr 15, 2015

yred Apr 15, 2015

shoyer Apr 15, 2015

shoyer commented Apr 15, 2015

yred commented Apr 15, 2015

shoyer commented Apr 15, 2015

yred commented Apr 16, 2015

hayd commented Apr 16, 2015

yred commented Apr 16, 2015

shoyer commented Apr 16, 2015

It currently doesn't print out index names. Would that be expected with this fix or should I add it in a separate PR?

yred commented Apr 16, 2015

hayd commented Apr 16, 2015

shoyer commented Apr 16, 2015

Fixed latex output for multi-indexed dataframes - GH9778 #9908

Fixed latex output for multi-indexed dataframes - GH9778 #9908

Conversation

yred commented Apr 15, 2015

yred Apr 15, 2015

Choose a reason for hiding this comment

shoyer Apr 15, 2015

Choose a reason for hiding this comment

yred Apr 15, 2015

Choose a reason for hiding this comment

shoyer Apr 15, 2015

Choose a reason for hiding this comment

yred Apr 15, 2015

Choose a reason for hiding this comment

shoyer Apr 15, 2015

Choose a reason for hiding this comment

shoyer commented Apr 15, 2015

yred commented Apr 15, 2015

shoyer commented Apr 15, 2015

yred commented Apr 16, 2015

hayd commented Apr 16, 2015

yred commented Apr 16, 2015

shoyer commented Apr 16, 2015

It currently doesn't print out index names. Would that be expected with this fix or should I add it in a separate PR?

yred commented Apr 16, 2015

hayd commented Apr 16, 2015

shoyer commented Apr 16, 2015