In this scraping project, the site 'https://quotes.toscrape.com/' was scarped. The following actions were performed :
- We logged into the site by extracting the csrf token
- We scraped the following data:
- Quotes
- Author
- Tags
- We filtered some of the quotes using regex to remove some unicode.
- We etracted the data into the json file named 'quotes'.
Keep in mind the sites used did not load in data dynamically and the data parsed was not stored into any database ( sql / nosql )