Skip to content
iterationlabs edited this page Sep 13, 2010 · 11 revisions

Overview

Parsley is a simple language for data-extraction from XML-like documents (including HTML). Parsley is:

  1. Blazing fast — Typical HTML parses are sub-50ms.
  2. Easy to write and understand — Parsley uses your current knowledge of JSON, CSS, and XPath.
  3. Powerful. Parsley can understand full CSS and XPath, including standard and user-defined functions.

Parselets are data extractors written in Parsley. Here is an example parselet:

{
 "title": "h1",
 "links(a)": [{
    "text": ".",
    "link": "@href"
  }]
}

See more examples in the Parsley Examples section.

Wiki Sections

Tutorial

Parsley Functions

Parsley Examples

Language Bindings: Ruby, Python

Resources

Clone this wiki locally