JanLinders-Webscraper

Web scraper that collects information about products at Jan linders.

Gets its data from jan linders.

How to use

All information is uploaded to a MySQL database. You can either run a database server locally using a program like xampp, or get a company to host one for you.

Change your credentials and database name in the db.py file to your own login details.

The table that is automatically created in your database will look like this:

CREATE TABLE `products` (
  `name` varchar(100) NOT NULL,
  `price` double NOT NULL,
  `brand` varchar(45) NOT NULL,
  `weight` varchar(45) NOT NULL,
  `group` varchar(45) NOT NULL,
  PRIMARY KEY (`name`,`price`,`brand`,`weight`)
) ENGINE=InnoDB DEFAULT CHARSET=latin1;

Ideas

Improve UI.
Show elapsed time when done.
Scrape offers page and store them in a seperate table.
Option to choose which groups to scrape.
Build test to see if the site structure is still the same.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

JanLinders-Webscraper

How to use

Ideas

Files

README.md

Latest commit

History

README.md

File metadata and controls

JanLinders-Webscraper

How to use

Ideas