Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Go Port of Dom Distiller #25

Open
markusmobius opened this issue Nov 4, 2020 · 0 comments
Open

Go Port of Dom Distiller #25

markusmobius opened this issue Nov 4, 2020 · 0 comments

Comments

@markusmobius
Copy link

For those interested in an updated version of Boilerpipe: the Chromium team based their DOM Distiller library for reader mode on Chrome on boilerpipe. Their program is in Java (like Boilerpipe) and is based on Boilerpipe (similar file structure to the boilerpipe repo etc). However, the library has GWT dependencies and is meant to compile to Javascript.

We ported the Java code of DOM Distiller to Go (without GWT and any Chromium dependencies):

https://github.com/markusmobius/go-domdistiller

It now works fine as a server-side program or command line program - similar to the original Boilerpipe.

The stable branch is the most faithful port of Distiller (everything minus some parts where Distiller relies on some render-level info).

The master branch includes insights from Mozilla's readability.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant