Skip to content

Latest commit

 

History

History
17 lines (13 loc) · 530 Bytes

README.md

File metadata and controls

17 lines (13 loc) · 530 Bytes

Web crawl and search

A shell script to crawl a website and return a list of pages that do not contain the specified string

usage: ./crawl.sh base_url search_scope search_string report_on
  base_url         the base protocal and domain (e.g. http://example.com)
  search_scope     one of internal, external or all
  search_string    the string that should exist on all pages
  report_on        one of exists or not_exists

Example usage

./crawl.sh https://microsoft.com internal windows not_exists