-
-
Notifications
You must be signed in to change notification settings - Fork 356
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Docx files being detected as a zip #312
Labels
Comments
I am getting the following type:
|
Borewit
added
bug
and removed
cannot reproduce
The issue described cannot be reproduced
labels
Jan 20, 2020
I can confirm this issue occurs with Powerpoint and Excel too. Both are part of the Office 365 suite. Here are some samples: |
Borewit
added a commit
that referenced
this issue
Jan 21, 2020
Borewit
added a commit
that referenced
this issue
Jan 21, 2020
Borewit
added a commit
that referenced
this issue
Jan 21, 2020
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I am using this library on an AWS ec2 instance. It seems that some docx files are being detected as zip files. I searched through closed issues and there are multiple closed issues and merged PRs about this topic. However, I assume most people aren't using the Windows 10 Office 365 dekstop apps.
Here's the version of Word I'm using:
Here is one of the closed issues I referenced above: #103
I haven't tried other document types using other Office 365 desktop apps (Excel, Powerpoint) but I have a feeling it could be the same issue. I'm going to do some more research and update this post if other file types fail as well.
For now, I'm attaching a sample docx that is being detected as a zip. This document was created using built in templates on the home page of word. I just chose one of their Resume templates for testing purposes.
Thank you
SampleResume.zipUofTCSCoop.docx
The text was updated successfully, but these errors were encountered: