Skip to content
This repository has been archived by the owner on Sep 14, 2024. It is now read-only.
/ href-scraper Public archive

ruby CLI to scrape hrefs from single webpages

License

Notifications You must be signed in to change notification settings

circa10a/href-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

href-scraper

Gem Version Gem

A ruby gem to get all the hrefs from a single URL(export as json)

Installation

Add this line to your application's Gemfile:

gem 'href_scraper'

And then execute:

$ bundle

Or install it yourself as:

$ gem install href_scraper

Usage

❯ href_scraper --url https://github.com
[
    "https://github.com/",
    "https://help.github.com/terms",
    "https://help.github.com/privacy",
    "https://enterprise.github.com/trial",
    "https://enterprise.github.com/contact",
    "https://help.github.com/terms",
    "https://help.github.com/privacy",
    "https://atom.io",
    "http://electron.atom.io/",
    "https://desktop.github.com/",
    "https://developer.github.com",
    "https://education.github.com/",
    "https://partner.github.com/",
    "https://community.github.com/",
    "https://github.com/about",
    "https://blog.github.com",
    "https://shop.github.com",
    "https://github.com/contact",
    "https://git.luolix.topmunity",
    "https://help.github.com",
    "https://status.github.com/",
    "https://help.github.com/articles/github-security/",
    "https://services.github.com/"
]

Releases

No releases published

Packages

No packages published