Skip to content

Noureldin2303/Web-scraping-using-Multithreading

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web-scraping

To extract data using web scraping with python using multithreading:

1- Find the URL that you want to scrape

2- Inspecting the Page

3- Find the data you want to extract

4- Write the code

5- Run the code and extract the data

6- Store the data in the required format

Download Browser driver is using

Chrome: https://sites.google.com/chromium.org/driver/



  • Importing packages: from selenium import webdriver
  • Create Chrome driver Instance: driver = webdriver.Chrome(r'Path in your computer where you have installed chromedriver')
  • Fetch webpage: driver.get('URL')
  • Parse webpage using Xpath: Data = driver.find_elements(By.XPATH,‘Xpath’)