-
Notifications
You must be signed in to change notification settings - Fork 894
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Preserve entity #102
Comments
is it really a "normal" space character? i mean, does |
I think it doesn't (I may be wrong). Even if it did, the main problem remains: when you see a white space, there's no way to tell them apart, so preserving the |
perhaps. I'm not sure if it's possible to "preserve" entities as jsdom converts all named (and numeric) references to their matching character(s). |
It's because So I think it should be changed into |
Another problem with not preserving <i>foo </i>bar Converts to: _foo _bar Because the |
In my case the following HTML is incorrectly parsed. The output Markdown doesn't have the trailing space.
Is converted into |
I tried to convert
Which is parsed into markdown as
Need to keep html entity space |
Just for reference - a through analysis on this topic: https://github.com/orchitech/turndown/wiki/Whitespace A PR will come soon. :) |
Do not merge ASCII and non-ASCII whitespace. Make sure non-ASCII whitespace is moved out of inline elements to prevent generating broken Markdown. Fix mixmark-io#102. Fix mixmark-io#250.
Do not merge ASCII and non-ASCII whitespace. Make sure non-ASCII whitespace is moved out of inline elements to prevent generating broken Markdown. Fix mixmark-io#102. Fix mixmark-io#250.
Do not merge ASCII and non-ASCII whitespace. Make sure non-ASCII whitespace is moved out of inline elements to prevent generating broken Markdown. Fix mixmark-io#102. Fix mixmark-io#250.
Do not merge ASCII and non-ASCII whitespace. Make sure non-ASCII whitespace is moved out of inline elements to prevent generating broken Markdown. Fix mixmark-io#102. Fix mixmark-io#250.
The non-breaking space entity (
) should not be by a normal space character.More generally, it may be useful to let the user decide how HTML entities should be dealt with (converted or preserved).
The text was updated successfully, but these errors were encountered: