Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

missing utf-8 BOM lead to codec failures during tests on windows #175

Open
aogier opened this issue Dec 12, 2022 · 0 comments
Open

missing utf-8 BOM lead to codec failures during tests on windows #175

aogier opened this issue Dec 12, 2022 · 0 comments

Comments

@aogier
Copy link
Contributor

aogier commented Dec 12, 2022

FYI, as #174 stimulated my curiosity, one of the two test files lacks BOM bytes:

$ hd  -n4 test/texts/sherlock.txt
00000000  ef bb bf 50                                       |...P|
00000004
$ hd  -n4 test/texts/senate-bills.txt
00000000  32 31 73 74                                       |21st|
00000004

and this seems not to please windows' machines (unices will probably have a better autodetection thing). I don't see this as a problem per se, as leaving correct input mangling is probably a user's task but it lowers x-files factor on previous commits ;)

thank you, regards

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant