A good start point of resources for make scraping.
- JamAPI – Parse web pages using CSS query selectors.
- wrapAPI – Build an API on top of any website.
- TwitRSS.me – Get your twitter user feed as RSS.
- YQL – Yahoo Query Language.
- cheerio – Fast, flexible, and lean implementation of core jQuery designed specifically for the server.
- jsdom – A JavaScript implementation of the WHATWG DOM and HTML standards, for use with Node.js.
- parse5 – WHATWG HTML5 specification-compliant, fast and ready for production HTML parsing/serialization toolset for Node.js.
- scrape-it – A Node.js scraper for humans.