Skip to content

Latest commit

 

History

History
30 lines (18 loc) · 587 Bytes

README.md

File metadata and controls

30 lines (18 loc) · 587 Bytes

Web privacy

Summary of web privacy, based on webbkoll.

Using

In common: pass an URL list in the urls arg to the summary spider

scrapy crawl summary -a urls="<URL_1 URL_2 ...>"

Linux

Pass an URL list to the run.sh, e.g.:

run.sh `cat urls.txt`

Windows

Just run.bat.

But you need to take care of using a Python virtual environment, installing the required Python packages and editing the urls.txt manually.

Use the run.sh as an example.

Note: remove --nolog from runners to debug.