Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ExcelFile(...).parse(skiprows=...) fails for rows at the end of the sheet #1843

Closed
lodagro opened this issue Sep 5, 2012 · 1 comment
Closed
Labels
Bug IO Data IO issues that don't fit into a more specific label
Milestone

Comments

@lodagro
Copy link
Contributor

lodagro commented Sep 5, 2012

from mailinglist (tried it myself on master also)

I've been using the ExcelFile function to read in a dataset that has some lines of text above and below the numbers I'm interested in.

The option skiprows=[0,1,3,4,5,6,7] works fine to ignore the text above the data (apart from line 2, which contains the labels).

However, if I add values 151, 152, etc., to try to ignore the text lines after the end of the data, this seems to have no effect. Is this behavior deliberate?*

I can get rid of those extra lines at the end by using dropna(), but this seems to have the effect of making the first series of type object rather than float, which is (a little) inconvenient.

@changhiskhan
Copy link
Contributor

Sorry, this was closed pre-maturely. I've enabled skip_footer for ExcelFile.parse and added clarification that skiprows should be for the start of the file.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Bug IO Data IO issues that don't fit into a more specific label
Projects
None yet
Development

No branches or pull requests

2 participants