Skip to content

Latest commit

 

History

History
13 lines (9 loc) · 587 Bytes

README.md

File metadata and controls

13 lines (9 loc) · 587 Bytes

NBoilerpipe is a C# port of boilerpipe 1.2 (http://code.google.com/p/boilerpipe/) library. Most of the code is converted with the Sharpen tool (https://github.com/slluis/sharpen). The code uses the Sharpen libary (with modification) from NGit project (https://github.com/slluis/ngit) and HmtlAgilityPack (http://htmlagilitypack.codeplex.com/).

NBoilerpipe is only been tested with Mono.

Usage:

using NBoilerpipe.Extractors;
...
String html = GetHtmlText();
var text = ArticleExtractor.INSTANCE.GetText (html);
//var text = DefaultExtractor.INSTANCE.GetText (html);
...