An extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.
-
Updated
Oct 12, 2024 - PHP
An extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.
Broken HTML Parser written in TypeScript, expected to be run on browsers as a bundled single js file. (each branch supports specific broken web pages)
Add a description, image, and links to the broken-html topic page so that developers can more easily learn about it.
To associate your repository with the broken-html topic, visit your repo's landing page and select "manage topics."