Releases: html-extract/hext
Hext v1.0.12
Install the htmlext command-line utility and Hext for Python (v3.13 or earlier):
pip install hext
Install Hext for Node (v23 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and macOS (≥10.11 for x86_64, ≥11.0 for ARM64/M1/M2).
Install Hext for JavaScript (Node/Browser on any platform):
npm install hext.js
For other installation methods please see the download page for Hext.
Changes
- Fix for #39: Python, Node: Symbols of linked static libraries (such as libstdc++) are no longer exported. This fixes an issue with libraries that do the same, such as "scipy.spatial". Thank you @impredicative for reporting this issue.
Hext v1.0.11
Install the htmlext command-line utility and Hext for Python (v3.13 or earlier):
pip install hext
Install Hext for Node (v23 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and macOS (≥10.11 for x86_64, ≥11.0 for ARM64/M1/M2).
Install Hext for JavaScript (Node/Browser on any platform):
npm install hext.js
For other installation methods please see the download page for Hext.
Changes
- Added support for Node v23, dropped Node v21
- Added support for Python v3.13, dropped v3.8
- Python wheels for Linux now require at least platform tag
manylinux_2_28
- Updated boost from 1.85.0 to 1.86.0
Hext v1.0.9
Install the htmlext command-line utility and Hext for Python (v3.12 or earlier):
pip install hext
Install Hext for Node (v22 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and macOS (≥10.11 for x86_64, ≥11.0 for ARM64/M1/M2).
Install Hext for JavaScript (Node/Browser on any platform):
npm install hext.js
For other installation methods please see the download page for Hext.
Changes
- Hext: Added a new trait
:type-matches(regex)
that selects elements with a tag name that matches the given regex. Originally suggested by @impredicative in #30. - Python: Improve error messages for argument type mismatch ( #28, #27 )
- Added release for Node v22
- Build with Boost.regex header-only, if available
- Updated boost from 1.83.0 to 1.85.0
Hext v1.0.8
Install the htmlext command-line utility and Hext for Python (v3.12 or earlier):
pip install hext
Install Hext for Node (v21 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and macOS (≥10.11 for x86_64, ≥11.0 for ARM64/M1/M2).
Install Hext for JavaScript (Node/Browser on any platform):
npm install hext.js
For other installation methods please see the download page for Hext.
Changes
- New releases for macOS on ARM64 built with the new M1 Github Runner.
- Node, Linux: Fixed accidental dependency on libgcc and libstdc++.
- Node, Linux: Fixed accidental dependency on GLIBC2.34. Now only GLIBC2.14 is required, as was previously.
Hext v1.0.6
Static binary releases
Install the htmlext command-line utility and Hext for Python (v3.12 or earlier):
pip install hext
Install Hext for Node (v21 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11 (x86_64).
Hext for JavaScript
Hext.js v1.0.6
See html-extract/hext.js for more.
Changes
- Added release for Node v21 and Python v3.12
- Dropped support for Node v19 and v16
- Dropped support for Python v3.7
- Updated boost from 1.82.0 to 1.83.0
- Improved compatibility with libgumbo forks (specifically the libgumbo fork that is used in Arch) 07d08ce
Hext v1.0.5
Static binary releases
Install the htmlext command-line utility and Hext for Python (v3.11 or earlier):
pip install hext
Install Hext for Node (v20 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11 (x86_64).
Hext for JavaScript
Hext.js v1.0.5
See html-extract/hext.js for more.
Changes
- Added release for node v20
- Dropped support for node v14
- Updated boost from 1.80.0 to 1.82.0
- Removed hext-ruby and hext-php from continuous integration
Hext v1.0.4
Static binary releases
Install the htmlext command-line utility and Hext for Python (v3.11 or earlier):
pip install hext
Install Hext for Node (v19 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11 (x86_64).
Hext for JavaScript
Hext.js v1.0.4
See html-extract/hext.js for more.
Changes
- Added release for node v19, python v3.11
- Dropped support for node v17, python v3.6
- Updated boost from 1.79.0 to 1.80.0
Hext v1.0.3
Static binary releases
Install the htmlext command-line utility and Hext for Python (v3.10 or earlier):
pip install hext
Install Hext for Node (v18 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11.
Hext for JavaScript
Hext.js v1.0.3
See html-extract/hext.js for more.
Changes
- Added release for node v18
- Dropped support for node v12
- Updated boost from 1.78.0 to 1.79.0
Hext v1.0.2
Static binary releases
Install the htmlext command-line utility and Hext for Python (v3.10 or earlier):
pip install hext
Install Hext for Node (v17 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11.
Hext for JavaScript
Hext.js v1.0.2
See html-extract/hext.js for more.
Changes
- Hext for Node: rewrite with node-addon-api ( 9aea455 )
- Hext for JavaScript: Improved API ( html-extract/hext.js#6 )
- Hext website: Improved download page
- libhext: Fix warnings on newer compilers
Hext v1.0.1
Static binary releases
Install the htmlext command-line utility and Hext for Python (v3.10 or earlier):
pip install hext
Install Hext for Node (v17 or earlier):
npm install hext
Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11.
Hext for WebAssembly
Hext Emscripten v1.0.1
See html-extract/hext-emscripten for more.
Changes
- Language bindings:
rule.extract
now supports the optional parametermax_searches
, which is of type unsigned int. The search for matching elements is aborted after this limit is reached. The default is 0, which never aborts. If running untrusted hext templates, I recommend settingmax_searches
to some high value, like 10000, to protect against resource exhaustion. - Consistent versions: htmlext, libhext and pypi now share the same version number, enforced by CI. The NPM package has the same version but incremented by 10.
- Refactored build and CI scripts