Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Docx files being detected as a zip #312

Closed
BradleyDHobbs opened this issue Jan 20, 2020 · 3 comments
Closed

Docx files being detected as a zip #312

BradleyDHobbs opened this issue Jan 20, 2020 · 3 comments
Assignees
Labels

Comments

@BradleyDHobbs
Copy link

BradleyDHobbs commented Jan 20, 2020

I am using this library on an AWS ec2 instance. It seems that some docx files are being detected as zip files. I searched through closed issues and there are multiple closed issues and merged PRs about this topic. However, I assume most people aren't using the Windows 10 Office 365 dekstop apps.

Here's the version of Word I'm using:
image

Here is one of the closed issues I referenced above: #103

I haven't tried other document types using other Office 365 desktop apps (Excel, Powerpoint) but I have a feeling it could be the same issue. I'm going to do some more research and update this post if other file types fail as well.

For now, I'm attaching a sample docx that is being detected as a zip. This document was created using built in templates on the home page of word. I just chose one of their Resume templates for testing purposes.

Thank you

SampleResume.zip
UofTCSCoop.docx

@Borewit Borewit self-assigned this Jan 20, 2020
@Borewit Borewit added the cannot reproduce The issue described cannot be reproduced label Jan 20, 2020
@Borewit
Copy link
Collaborator

Borewit commented Jan 20, 2020

I am getting the following type:

MIME-type: application/vnd.openxmlformats-officedocument.wordprocessingml.document
Extension: docx

@Borewit
Copy link
Collaborator

Borewit commented Jan 20, 2020

I gave it another try with the original sample of #103, which gave me an error. Created issue: #313.

@Borewit Borewit added bug and removed cannot reproduce The issue described cannot be reproduced labels Jan 20, 2020
@BradleyDHobbs
Copy link
Author

I can confirm this issue occurs with Powerpoint and Excel too. Both are part of the Office 365 suite.

Here are some samples:

SamplePowerpoint.pptx
SampleExcel.xlsx

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants