Skip to content

Releases: html-extract/hext

Hext v1.0.12

03 Nov 10:16
Compare
Choose a tag to compare

Install the htmlext command-line utility and Hext for Python (v3.13 or earlier):

pip install hext

Install Hext for Node (v23 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and macOS (≥10.11 for x86_64, ≥11.0 for ARM64/M1/M2).

Install Hext for JavaScript (Node/Browser on any platform):

npm install hext.js

For other installation methods please see the download page for Hext.

Changes

  • Fix for #39: Python, Node: Symbols of linked static libraries (such as libstdc++) are no longer exported. This fixes an issue with libraries that do the same, such as "scipy.spatial". Thank you @impredicative for reporting this issue.

Hext v1.0.11

17 Oct 18:12
Compare
Choose a tag to compare

Install the htmlext command-line utility and Hext for Python (v3.13 or earlier):

pip install hext

Install Hext for Node (v23 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and macOS (≥10.11 for x86_64, ≥11.0 for ARM64/M1/M2).

Install Hext for JavaScript (Node/Browser on any platform):

npm install hext.js

For other installation methods please see the download page for Hext.

Changes

  • Added support for Node v23, dropped Node v21
  • Added support for Python v3.13, dropped v3.8
  • Python wheels for Linux now require at least platform tag manylinux_2_28
  • Updated boost from 1.85.0 to 1.86.0

Hext v1.0.9

25 Apr 12:58
Compare
Choose a tag to compare

Install the htmlext command-line utility and Hext for Python (v3.12 or earlier):

pip install hext

Install Hext for Node (v22 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and macOS (≥10.11 for x86_64, ≥11.0 for ARM64/M1/M2).

Install Hext for JavaScript (Node/Browser on any platform):

npm install hext.js

For other installation methods please see the download page for Hext.

Changes

  • Hext: Added a new trait :type-matches(regex) that selects elements with a tag name that matches the given regex. Originally suggested by @impredicative in #30.
  • Python: Improve error messages for argument type mismatch ( #28, #27 )
  • Added release for Node v22
  • Build with Boost.regex header-only, if available
  • Updated boost from 1.83.0 to 1.85.0

Hext v1.0.8

09 Nov 15:58
Compare
Choose a tag to compare

Install the htmlext command-line utility and Hext for Python (v3.12 or earlier):

pip install hext

Install Hext for Node (v21 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and macOS (≥10.11 for x86_64, ≥11.0 for ARM64/M1/M2).

Install Hext for JavaScript (Node/Browser on any platform):

npm install hext.js

For other installation methods please see the download page for Hext.

Changes

  • New releases for macOS on ARM64 built with the new M1 Github Runner.
  • Node, Linux: Fixed accidental dependency on libgcc and libstdc++.
  • Node, Linux: Fixed accidental dependency on GLIBC2.34. Now only GLIBC2.14 is required, as was previously.

Hext v1.0.6

17 Oct 22:40
Compare
Choose a tag to compare

Static binary releases

Install the htmlext command-line utility and Hext for Python (v3.12 or earlier):

pip install hext

Install Hext for Node (v21 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11 (x86_64).

Hext for JavaScript

Hext.js v1.0.6
See html-extract/hext.js for more.

Changes

  • Added release for Node v21 and Python v3.12
  • Dropped support for Node v19 and v16
  • Dropped support for Python v3.7
  • Updated boost from 1.82.0 to 1.83.0
  • Improved compatibility with libgumbo forks (specifically the libgumbo fork that is used in Arch) 07d08ce

Hext v1.0.5

21 Apr 16:05
Compare
Choose a tag to compare

Static binary releases

Install the htmlext command-line utility and Hext for Python (v3.11 or earlier):

pip install hext

Install Hext for Node (v20 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11 (x86_64).

Hext for JavaScript

Hext.js v1.0.5
See html-extract/hext.js for more.

Changes

  • Added release for node v20
  • Dropped support for node v14
  • Updated boost from 1.80.0 to 1.82.0
  • Removed hext-ruby and hext-php from continuous integration

Hext v1.0.4

29 Oct 09:00
Compare
Choose a tag to compare

Static binary releases

Install the htmlext command-line utility and Hext for Python (v3.11 or earlier):

pip install hext

Install Hext for Node (v19 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11 (x86_64).

Hext for JavaScript

Hext.js v1.0.4
See html-extract/hext.js for more.

Changes

  • Added release for node v19, python v3.11
  • Dropped support for node v17, python v3.6
  • Updated boost from 1.79.0 to 1.80.0

Hext v1.0.3

22 Apr 13:36
Compare
Choose a tag to compare

Static binary releases

Install the htmlext command-line utility and Hext for Python (v3.10 or earlier):

pip install hext

Install Hext for Node (v18 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11.

Hext for JavaScript

Hext.js v1.0.3
See html-extract/hext.js for more.

Changes

  • Added release for node v18
  • Dropped support for node v12
  • Updated boost from 1.78.0 to 1.79.0

Hext v1.0.2

09 Jan 18:17
Compare
Choose a tag to compare

Static binary releases

Install the htmlext command-line utility and Hext for Python (v3.10 or earlier):

pip install hext

Install Hext for Node (v17 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11.

Hext for JavaScript

Hext.js v1.0.2
See html-extract/hext.js for more.

Changes

Hext v1.0.1

17 Dec 17:19
Compare
Choose a tag to compare

Static binary releases

Install the htmlext command-line utility and Hext for Python (v3.10 or earlier):

pip install hext

Install Hext for Node (v17 or earlier):

npm install hext

Both are compatible with Linux (x86_64) and Mac OS X ≥ 10.11.

Hext for WebAssembly

Hext Emscripten v1.0.1
See html-extract/hext-emscripten for more.

Changes

  • Language bindings: rule.extract now supports the optional parameter max_searches, which is of type unsigned int. The search for matching elements is aborted after this limit is reached. The default is 0, which never aborts. If running untrusted hext templates, I recommend setting max_searches to some high value, like 10000, to protect against resource exhaustion.
  • Consistent versions: htmlext, libhext and pypi now share the same version number, enforced by CI. The NPM package has the same version but incremented by 10.
  • Refactored build and CI scripts