Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What to do if line have "<" and ">" in text? #7

Open
Gh0sTG0 opened this issue Nov 27, 2023 · 0 comments
Open

What to do if line have "<" and ">" in text? #7

Gh0sTG0 opened this issue Nov 27, 2023 · 0 comments

Comments

@Gh0sTG0
Copy link

Gh0sTG0 commented Nov 27, 2023

Hi.
This is crashing with "not well-formed (invalid token): line 1, column 67".
When I checked what is in inner html that I'm trying to parse, I find, that problems are in this line (yes, I modified it a bit, so it becomes user-friendly):

<h1 class="the_class"><a href="/the_link.html" title="The title is <So important>">And some text here</a></h1>
-----------------------------------------------------------------^

And error links us to where "is" ends.
If I'm correct, fail is because there, on page, someone named title with "<" and ">", so it triggers on it as on new tag and wants me to fix things that there should be something like:

...="The title is" <So

Or something like that.

Is there some way to make it think that anyway there should not be any tags inside "the quotes", even if there are some "<" and ">" inside?

PS At the moment I can just remove title part, but... how soon I will find that "<" and ">" are in part that I need?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant