Skip to content
This repository has been archived by the owner on Dec 15, 2022. It is now read-only.

unicode (multibyte) characters are corrupted #25

Merged
merged 3 commits into from
Jul 9, 2015
Merged

unicode (multibyte) characters are corrupted #25

merged 3 commits into from
Jul 9, 2015

Conversation

aki77
Copy link
Contributor

@aki77 aki77 commented Jun 25, 2015

it's not safe to decode buffers to strings.
they may end part-way through a multi-byte character, hence the issue.

atom/atom#4542

@tomoya
Copy link

tomoya commented Jun 30, 2015

Awesome 🎉

Maybe fixes atom/atom#4268 too.

This is a very important issue. Because the file would have been broken by Atom. I believe it to be resolved soon.

@benogle
Copy link
Contributor

benogle commented Jul 1, 2015

Cool.

We should pass the encoding in from atom. Can you make the encoding an option? We can fallback to utf8.

@benogle benogle self-assigned this Jul 1, 2015
@tomoya
Copy link

tomoya commented Jul 1, 2015

Sounds cool!

benogle added a commit that referenced this pull request Jul 9, 2015
unicode (multibyte) characters are corrupted
@benogle benogle merged commit 7e8d230 into atom:master Jul 9, 2015
@benogle
Copy link
Contributor

benogle commented Jul 9, 2015

Thanks for this. It is better, but there are still a bunch of other encoding issues. I will create another issue outlining them. Encodings are hard.

@aki77
Copy link
Contributor Author

aki77 commented Jul 10, 2015

Thanks!

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants