Invalid html generated when parsing link #130

rchl · 2022-06-18T18:49:27Z

Python 3.3>>> mdpopups.show_popup(view, 'and [`i32`](`i32`) is')
Parse Error: ">i32</span>"><code class="highlight"><span style="color: #d8dee9;">i32</span></code></a> is</p></div> code: Unexpected character

The generated HTML is:

and <a href="<span style="color: #d8dee9;">i32</span>"><code class="highlight"><span style="color: #d8dee9;">i32</span></code></a> is

Which is invalid since the content of the href attribute is not escaped.

The escaping would make the code valid but I believe that what should happen is nothing in the link should be transformed into HTML and instead it should be taken literally, and then also escaped if needed.

This is how Github renders it:

and i32 is

and <a href="%60i32%60"><code class="notranslate">i32</code></a> is

The text was updated successfully, but these errors were encountered:

facelessuser · 2022-06-18T18:56:39Z

Yeah, I'm actually aware of this, and I'll have a fix for the next version. There was some complaint at some point about the fact that when Sublime was processing callbacks in HTML links that escaped quotes weren't handled properly by Sublime. A change was made to not escape them, which is simply a bad idea that was rarely encountered. We just need to let them be escaped.

Anyways, this is an issue in mdpopups, not any of the underlying libraries.

rchl · 2022-06-18T19:06:35Z

And the fixed behavior will also not convert markdown to html in links?

facelessuser · 2022-06-18T19:10:38Z

And the fixed behavior will also not covert markdown to html in links?

I'm not sure I understand the question. We are talking about how proper escaping in HTML attributes right?

What do you mean by your question, can you clarify with an example?

rchl · 2022-06-18T19:13:48Z

The fixed logic could result in either:

<a href="%60i32%60">
or
<a href="<span style="color: #d8dee9;">i32</span>">

I believe that the former one is correct since markdown should not be processed in URIs.

facelessuser · 2022-06-18T19:21:38Z

Can you give me a reproducible Markdown source? I see the expected outputs, but I'm still not sure what you are providing as source.

facelessuser · 2022-06-18T19:23:14Z

Wait, are you trying to provide Markdown code syntax as a link? If so, you aren't getting that to convert properly. You need to provide a link or properly escaped content for reference.

facelessuser · 2022-06-18T19:24:15Z

You'll see all sorts of different outputs with different Markdown parsers. Some might handle it more sane than others, but you should not be doing this: https://johnmacfarlane.net/babelmark2/?text=%5B%60i32%60%5D(%60i32%60)

facelessuser · 2022-06-18T19:25:36Z

I honestly thought you were talking about something else.

rchl · 2022-06-18T19:26:30Z

It's the same example I gave in the initial comment:

'and [`i32`](`i32`) is'

It contains backticks in the link's URI and it comes from the LSP server.

I've talked about it in the initial comment and also shown how Github's parser handles it.

rchl · 2022-06-18T19:31:45Z

Just escaping the link will fix the main issue that causes parsing error so it might be enough to do that.

Since a link like the one provided here as an example will likely not do anything useful, it might as well contain escaped HTML but it feels like the ultimate fix would be to not process markdown in links, besides escaping.

facelessuser · 2022-06-18T20:20:28Z

It's the same example I gave in the initial comment:

Yes, I was on my phone, but I see that now. Basically, code is handled before URLs, which is why this happens. That is just the sequencing for how Python Markdown does things.

I've talked about it in the initial comment and also shown how Github's parser handles it.

Yep, how GitHub handles this is irrelevant. Currently, we are using the Python Markdown parser.

I've mentioned at one time that I'm potentially willing to add support for a CommonMark parser as soon as Package Control supports Py38 dependencies, though it's starting to feel like they may never happen...

Anyways, if you format the content in an escaped manner, it'll probably go through fine. I don't plan on patching the underlying library to workaround odd cases that it was not designed to handle. One could argue this is a bug, but it would be an upstream bug with Python Markdown, but as I'm familiar with its architecture, I'm doubtful a fix will be coming.

facelessuser · 2022-06-19T03:37:38Z

@rchl Hmm, there may be an issue with how we do things using the Sublime highlighter. Turning off the Sublime highlighter (which causes Pygments to be used instead) doesn't seem to exhibit this issue.

I wanted to do my due diligence before closing this issue, so there may actually be something for us to fix here. I'll try to dig deeper.

facelessuser · 2022-06-19T03:58:05Z

Hmm, it just appears that when Pygments handles a plain text code, it simply returns the text and styles it with CSS, but with the Sublime highlighter, we actually add inline styles, which is why we get HTML content. Really, Markdown should not be used as a URL.

I'm not sure if there is anything we can reasonably do, but I'll take a look if there is something simple we can do without trying to monkey patch Markdown itself.

facelessuser · 2022-06-19T04:03:41Z

Yeah, we can break pygments with [`123`](`#!py3 123`) which will then perform highlighting which causes spans to be written:

<p><a href="<span class="mi">123</span>
"><code class="highlight">123
</code></a></p>

gir-bot added the S: triage Issue needs triage. label Jun 18, 2022

facelessuser added T: bug Bug. S: confirmed Confirmed bug report or approved feature request. and removed S: triage Issue needs triage. labels Jun 18, 2022

jwortmann mentioned this issue Aug 9, 2022

Popup text is truncated at & in Rust source sublimelsp/LSP#2012

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Invalid html generated when parsing link #130

Invalid html generated when parsing link #130

rchl commented Jun 18, 2022

facelessuser commented Jun 18, 2022

rchl commented Jun 18, 2022 •

edited

Loading

facelessuser commented Jun 18, 2022

rchl commented Jun 18, 2022 •

edited

Loading

facelessuser commented Jun 18, 2022

facelessuser commented Jun 18, 2022

facelessuser commented Jun 18, 2022

facelessuser commented Jun 18, 2022

rchl commented Jun 18, 2022

rchl commented Jun 18, 2022

facelessuser commented Jun 18, 2022

facelessuser commented Jun 19, 2022

facelessuser commented Jun 19, 2022

facelessuser commented Jun 19, 2022 •

edited

Loading

Invalid html generated when parsing link #130

Invalid html generated when parsing link #130

Comments

rchl commented Jun 18, 2022

facelessuser commented Jun 18, 2022

rchl commented Jun 18, 2022 • edited Loading

facelessuser commented Jun 18, 2022

rchl commented Jun 18, 2022 • edited Loading

facelessuser commented Jun 18, 2022

facelessuser commented Jun 18, 2022

facelessuser commented Jun 18, 2022

facelessuser commented Jun 18, 2022

rchl commented Jun 18, 2022

rchl commented Jun 18, 2022

facelessuser commented Jun 18, 2022

facelessuser commented Jun 19, 2022

facelessuser commented Jun 19, 2022

facelessuser commented Jun 19, 2022 • edited Loading

rchl commented Jun 18, 2022 •

edited

Loading

rchl commented Jun 18, 2022 •

edited

Loading

facelessuser commented Jun 19, 2022 •

edited

Loading