Scraper

Scraper is a terminal web-scraper, which does content from url more readable.

Scraper gets html content from url
Clears it from tags, unnecessary garbage and so on
Formats the content, according set format (currently formatting supported only for links)
Saves the content to file with path, which gotten from url (for instance, https://microsoft.com/ -> ".\microsoft.com\index.txt")

You can also:

Scraper uses the follow libraries:

To run scraper using terminal, it is necessary to have Python with 3.7+ version. When it will be installed, you can run scraper.

Sample: python main.py --url "https://habr.com/ru/post/446816/" or python main.py -u "https://habr.com/ru/post/446816/"

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
common.py		common.py
config.json		config.json
extractor.py		extractor.py
formatter.py		formatter.py
main.py		main.py
saver.py		saver.py
scraper.py		scraper.py

Provide feedback