YellowPage-scraper

Welcome to the Yellowpage Webscraper using Python Playwright! This repository contains the code for a web scraper that can extract information from yellow pages websites. The scraper uses the Python Playwright library to automate the process of browsing and extracting data from the website. To get started, you will need to have Python and and the necessary requirements installed on your machine. You can install Playwright by running the following command:

pip install -r requirements.txt
playwright install

The repository includes the following files:

scraper.py: This is the main script that initiate the automation. tools.py: This file contains the main code for the scrapera. output.xlsx: This file will be created by the script and will contain the extracted data in xlsx format.

To run the script, simply navigate to the repository directory and run the following command:

python scraper.py

The script will then start extracting data from the website based on the configuration settings and will save the data to the output.xlsx file.

Please note that the script is designed to work with yellow pages websites and may not work with other types of websites. Additionally, the script may be blocked by the website if it detects excessive scraping activity, so please use it responsibly.

If you have any issues or suggestions for improvements, please feel free to open an issue on the repository or submit a pull request.

Thank you for using the Yellowpage!

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
scrapers		scrapers
tools		tools
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
main.py		main.py
requirements.txt		requirements.txt
tempCodeRunnerFile.py		tempCodeRunnerFile.py
user-agents.txt		user-agents.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YellowPage-scraper

About

Languages

License

sushil-rgb/YellowPage-scraper

Folders and files

Latest commit

History

Repository files navigation

YellowPage-scraper

About

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Languages