TheScrapper is a versatile web scraping tool designed to extract emails, phone numbers, and social media accounts from websites. You can use the gathered information for various purposes, such as further research or contacting the website's owners.
To get started with TheScrapper, follow these simple installation steps:
- Clone the repository:
git clone https://github.com/champmq/TheScrapper.git
- Change the directory:
cd TheScrapper
- Install all the requirements:
pip3 install -r requirements.txt
TheScrapper offers several usage options:
- Simple scan:
python3 TheScrapper.py --url URL
- Scan and crawl found URLs:
python3 TheScrapper.py --url URL --crawl
- Retrieve more informaton about found social media accounts:
python3 TheScrapper.py --url URL -s
For additional command-line arguments and options, refer to the help menu:
python3 TheScrapper.py -h
To remove the banner, simply add the "-b" flag.
If you wish to add more social media sites for scraping. You can do so by appending them to the socials.txt
file. Feel free to contribute by submtting a pull request if you'd like to share your additions with the community.
When using a website that is alreadt included in the socials.txt
file, the --sm
flag may produce less useful output. To avoid this, consider excluding such URLs or refraining from using the --sm
flag.