-
Notifications
You must be signed in to change notification settings - Fork 113
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error with parsing when HTML tags uppercased #26
Comments
@miso-belica thanks. We'll take a look into this. |
I think I've found the problem. I'm not yet sure of the fix though. In the tokenizer when processing raw text you have:
At this point |
…nd normalizing tag names to lowercase (per 8.2.4.9) except for SVG foreign tags that are case sensitive.
@miso-belica I think this fixes it. Please let me know if uppercase tags are still a problem. |
I think it's OK now. Thanks 👍 |
Hi,
I discovered some weird behavior at this page http://rayer.g6.cz/. I also pasted source HTML here http://pastebin.com/FQjSEGCK .
Everything from the text in
html > head > title
is escaped (even</TITLE>
tag). I find out that if I use functionstrtolower
like this\HTML5::loadHTML(strtolower($html))
HTML is parsed correctly. Can you look at this please?Thank you for your work - I can parse HTML also in PHP finally :)
The text was updated successfully, but these errors were encountered: