Skip to content

Link Checker is a Python Scrapy project to check link status.

Notifications You must be signed in to change notification settings

shivamsupr/link-checker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Link Checker

Link Checker is a crawler to check link status written in python scrapy framework.

Dependencies

Use the package manager pip to install foobar.

Python >=3.6
pipenv

urllib3
scrapy >=1.5

Activate env

pipenv shell

Run

scrapy crawl link_checker_spider -a manifest_url=https://www.sangam.com/asset-manifest.json -a input_url=https://www.sangam.com -a fetch_from_file=true -a only_broken=true -a filepath=sangam-domains.json -o items.json
-o output-file
-a command-line arguments

# Command-line Arguments:
manifest_url
input_url
only_broken
fetch_from_file
filepath (required only when fetch_from_file=true)
*** add your own domain list json in shared directory and provide name in filepath ***

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

MIT

About

Link Checker is a Python Scrapy project to check link status.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages